Selective checkpointing and rollbacks in multithreaded distributed systems

M. Kasbekar, Chitaranjan Das

Research output: Contribution to conferencePaperpeer-review

9 Scopus citations

Abstract

Modern distributed systems are often multithreaded and object-oriented in their design. They require efficient techniques to checkpoint and restore their state for improving fault-tolerance properties. The traditional process-based techniques of distributed checkpointing and rollback algorithms suffer from the problem of false dependencies, which makes them very rigid and inefficient for use with modern systems. In this paper, we develop protocols that can selectively checkpoint (and rollback) some threads of a distributed system, while leaving others untouched and yet ensuring the consistency of state resulting from such a partial rollback.

Original languageEnglish (US)
Pages39-46
Number of pages8
StatePublished - 2001
Event21st IEEE International Conference on Distributed Computing Systems - Mesa, AZ, United States
Duration: Apr 16 2001Apr 19 2001

Other

Other21st IEEE International Conference on Distributed Computing Systems
Country/TerritoryUnited States
CityMesa, AZ
Period4/16/014/19/01

All Science Journal Classification (ASJC) codes

  • Software
  • Hardware and Architecture
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Selective checkpointing and rollbacks in multithreaded distributed systems'. Together they form a unique fingerprint.

Cite this