CiteSeerx: An architecture and Web service design for an academic document search engine

Huajing Li, Isaac Councill, Wang Chien Lee, C. Lee Giles

Research output: Chapter in Book/Report/Conference proceedingConference contribution

55 Scopus citations

Abstract

CiteSeer is a scientific literature digital library and search engine which automatically crawls and indexes scientific documents in the field of computer and information science. After serving as a public search engine for nearly ten years, CiteSeer is starting to have scaling problems for handling of more documents, adding new feature and more users. Its monolithic architecture design prevents it from effectively making use of new web technologies and providing new services. After analyzing the current system problems, we propose a new architecture and data model, CiteSeerx. CiteSeerx that will overcome the existing problems as well as provide scalability and better performance plus new services and system features.

Original languageEnglish (US)
Title of host publicationProceedings of the 15th International Conference on World Wide Web
Pages883-884
Number of pages2
DOIs
StatePublished - 2006
Event15th International Conference on World Wide Web - Edinburgh, Scotland, United Kingdom
Duration: May 23 2006May 26 2006

Publication series

NameProceedings of the 15th International Conference on World Wide Web

Other

Other15th International Conference on World Wide Web
Country/TerritoryUnited Kingdom
CityEdinburgh, Scotland
Period5/23/065/26/06

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Software

Fingerprint

Dive into the research topics of 'CiteSeerx: An architecture and Web service design for an academic document search engine'. Together they form a unique fingerprint.

Cite this