Discretionary caching for I/O on clusters

Murali Vilayannur, Anand Sivasubramaniam, Mahmut Kandemir, Rajeev Thakur, Robert Ross

Research output: Chapter in Book/Report/Conference proceedingConference contribution

16 Scopus citations

Abstract

I/O bottlenecks are already a problem in many largescale applications that manipulate huge datasets. This problem is expected to get worse as applications get larger, and the I/O subsystem performance lags behind processor and memory speed improvements. Caching I/O blocks is one effective way of alleviating disk latencies, and there can be multiple levels of caching on a cluster of workstations. Previous studies have shown the benefits of caching whether it be local to a particular node, or a shared global cache across the cluster - for certain applications. However, we show that while caching is useful in some situations, it can hurt performance if we are not careful about what to cache and when to bypass the cache. This paper presents compilation techniques and runtime support to address this problem. These techniques are implemented and evaluated on an experimental Linux/Pentium cluster running a parallel file system. Our results using a diverse set of applications (scientific and commercial) demonstrate the benefits of a discretionary approach to caching for I/O subsystems on clusters, providing as much as 33% savings over indiscriminately caching everything in some applications.

Original languageEnglish (US)
Title of host publicationProceedings - CCGrid 2003
Subtitle of host publication3rd IEEE/ACM International Symposium on Cluster Computing and the Grid
Pages96-103
Number of pages8
DOIs
StatePublished - 2003
Event3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, CCGrid 2003 - Tokyo, Japan
Duration: May 12 2003May 15 2003

Publication series

NameProceedings - CCGrid 2003: 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid

Other

Other3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, CCGrid 2003
Country/TerritoryJapan
CityTokyo
Period5/12/035/15/03

All Science Journal Classification (ASJC) codes

  • Computational Theory and Mathematics
  • Computer Science Applications
  • Software
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Discretionary caching for I/O on clusters'. Together they form a unique fingerprint.

Cite this