TY - GEN
T1 - CiteSeerχ - A scalable autonomous scientific digital library
AU - Li, Huajing
AU - Councill, Isaac G.
AU - Bolelli, Levent
AU - Zhou, Ding
AU - Song, Yang
AU - Lee, Wang Chien
AU - Sivasubramaniam, Anand
AU - Lee Giles, C.
PY - 2006
Y1 - 2006
N2 - CiteSeer is a scientific literature digital library and search engine which automatically crawls and indexes scientific documents in the fields of computer and information science. Since it's inception in 1997 CiteSeer has grown to index over 730,000 documents and serves over 800,000 requests daily, pushing the limits of the current system's capabilities. In addition, CiteSeer's monolithic architecture inconveniences system maintenance and reduces the flexibility of the system in terms of new feature development, algorithm updates, and system interoperability. In this paper, we discuss the problems of the current CiteSeer architecture and propose a new architecture for a next generation CiteSeer application. The new architecture is based on modular web services and pluggable service components. Preliminary results based on a prototype system show the new architecture enhances flexibility, scalability, and performance for CiteSeer. In addition, new services in development for the next generation CiteSeer system are discussed.
AB - CiteSeer is a scientific literature digital library and search engine which automatically crawls and indexes scientific documents in the fields of computer and information science. Since it's inception in 1997 CiteSeer has grown to index over 730,000 documents and serves over 800,000 requests daily, pushing the limits of the current system's capabilities. In addition, CiteSeer's monolithic architecture inconveniences system maintenance and reduces the flexibility of the system in terms of new feature development, algorithm updates, and system interoperability. In this paper, we discuss the problems of the current CiteSeer architecture and propose a new architecture for a next generation CiteSeer application. The new architecture is based on modular web services and pluggable service components. Preliminary results based on a prototype system show the new architecture enhances flexibility, scalability, and performance for CiteSeer. In addition, new services in development for the next generation CiteSeer system are discussed.
UR - http://www.scopus.com/inward/record.url?scp=34547294914&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34547294914&partnerID=8YFLogxK
U2 - 10.1145/1146847.1146865
DO - 10.1145/1146847.1146865
M3 - Conference contribution
AN - SCOPUS:34547294914
SN - 1595934286
SN - 9781595934284
T3 - ACM International Conference Proceeding Series
BT - Proceedings of the 1st International Conference on Scalable Information Systems, InfoScale '06
T2 - 1st International Conference on Scalable Information Systems, InfoScale '06
Y2 - 30 May 2006 through 1 June 2006
ER -