Abstract
Modern distributed systems are often multithreaded and object-oriented in their design. They require efficient techniques to checkpoint and restore their state for improving fault-tolerance properties. The traditional process-based techniques of distributed checkpointing and rollback algorithms suffer from the problem of false dependencies, which makes them very rigid and inefficient for use with modern systems. In this paper, we develop protocols that can selectively checkpoint (and rollback) some threads of a distributed system, while leaving others untouched and yet ensuring the consistency of state resulting from such a partial rollback.
Original language | English (US) |
---|---|
Pages | 39-46 |
Number of pages | 8 |
State | Published - 2001 |
Event | 21st IEEE International Conference on Distributed Computing Systems - Mesa, AZ, United States Duration: Apr 16 2001 → Apr 19 2001 |
Other
Other | 21st IEEE International Conference on Distributed Computing Systems |
---|---|
Country/Territory | United States |
City | Mesa, AZ |
Period | 4/16/01 → 4/19/01 |
All Science Journal Classification (ASJC) codes
- Software
- Hardware and Architecture
- Computer Networks and Communications