Cloud computing: A digital libraries perspective

Pradeep Teregowda, Bhuvan Urgaonkar, C. Lee Giles

Research output: Chapter in Book/Report/Conference proceedingConference contribution

31 Scopus citations

Abstract

Provisioning and maintenance of infrastructure for Web based digital library search engines such as CiteSeerx present several challenges. CiteSeerx provides autonomous citation indexing, full text indexing, and extensive document metadata from documents crawled from the web across computer and information sciences and related fields. Infrastructure virtualization and cloud computing are particularly attractive choices for CiteSeerx, which is challenged by both growth in the size of the indexed document collection, new features and most prominently usage. In this paper, we discuss constraints and choices faced by information retrieval systems like CiteSeerx by exploring in detail aspects of placing CiteSeerx into current cloud infrastructure offerings. We also implement an ad-hoc virtualized storage system for experimenting with adoption of cloud infrastructure services. Our results show that a cloud implementation of CiteSeerx may be a feasible alternative for its continued operation and growth.

Original languageEnglish (US)
Title of host publicationProceedings - 2010 IEEE 3rd International Conference on Cloud Computing, CLOUD 2010
Pages115-122
Number of pages8
DOIs
StatePublished - 2010
Event3rd IEEE International Conference on Cloud Computing, CLOUD 2010 - Miami, FL, United States
Duration: Jul 5 2010Jul 10 2010

Publication series

NameProceedings - 2010 IEEE 3rd International Conference on Cloud Computing, CLOUD 2010

Other

Other3rd IEEE International Conference on Cloud Computing, CLOUD 2010
Country/TerritoryUnited States
CityMiami, FL
Period7/5/107/10/10

All Science Journal Classification (ASJC) codes

  • Computational Theory and Mathematics
  • Theoretical Computer Science

Fingerprint

Dive into the research topics of 'Cloud computing: A digital libraries perspective'. Together they form a unique fingerprint.

Cite this