An integrated approach to parallel scheduling using gang-scheduling, backfilling, and migration

Yanyong Zhang, Hubertus Franke, Jose Moreira, Anand Sivasubramaniam

Research output: Contribution to journalArticlepeer-review

97 Scopus citations

Abstract

Effective scheduling strategies to improve response times, throughput, and utilization are an important consideration in large supercomputing environments. Parallel machines in these environments have traditionally used space-sharing strategies to accommodate multiple jobs at the same time by dedicating the nodes to a single job until it completes. This approach, however, can result in low system utilization and large job wait times. This paper discusses three techniques that can be used beyond simple space-sharing to improve the performance of large parallel systems. The first technique we analyze is backfilling, the second is gang-scheduling, and the third is migration. The main contribution of this paper is an analysis of the effects of combining the above techniques. Using extensive simulations based on detailed models of realistic workloads, the benefits of combining the various techniques are shown over a spectrum of performance criteria.

Original languageEnglish (US)
Pages (from-to)236-247
Number of pages12
JournalIEEE Transactions on Parallel and Distributed Systems
Volume14
Issue number3
DOIs
StatePublished - Mar 2003

All Science Journal Classification (ASJC) codes

  • Signal Processing
  • Hardware and Architecture
  • Computational Theory and Mathematics

Fingerprint

Dive into the research topics of 'An integrated approach to parallel scheduling using gang-scheduling, backfilling, and migration'. Together they form a unique fingerprint.

Cite this