A hybrid cache and prefetch mechanism for scientific literature search engines

Huajing Li, Wang Chien Lee, Anand Sivasubramaniam, C. Lee Giles

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Scopus citations

Abstract

CiteSeer, a scientific literature search engine that focuses on documents in the computer science and information science domains, suffers from scalability issue on the number of requests and the size of indexed documents, which increased dramatically over the years. CiteSeerχ is an effort to re-architect the search engine. In this paper, we present our initial design of a framework for caching query results, indices, and documents. This design is based on analysis of logged workload in CiteSeer. Our experiments based on mock client requests that simulate actual user behaviors confirm that our approach works well in enhancing system performances.

Original languageEnglish (US)
Title of host publicationWeb Engineering - 7th International Conference, ICWE 2007, Proceedings
PublisherSpringer Verlag
Pages121-136
Number of pages16
ISBN (Print)3540735968, 9783540735960
DOIs
StatePublished - 2007
Event7th International Conference on Web Engineering, ICWE 2007 - Como, Italy
Duration: Jul 16 2007Jul 20 2007

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4607 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other7th International Conference on Web Engineering, ICWE 2007
Country/TerritoryItaly
CityComo
Period7/16/077/20/07

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'A hybrid cache and prefetch mechanism for scientific literature search engines'. Together they form a unique fingerprint.

Cite this