Implementation of recoverable distributed shared memory by logging writes Conference Paper uri icon

abstract

  • Distributed shared memory, by avoiding the programming complexities of message passing, has become a convenient model to work with. But the benefits given by these systems can possibly be achieved only if the whole system behaves like a failure-free system. Many algorithms that have been proposed for implementing a reliable DSM require the processes to take checkpoints whenever there is a data transfer, thus resulting in a heavy overhead during failure-free execution. We present an algorithm to provide recoverable DSM for sequential consistency where the checkpoint interval can be tailored to balance the cost of checkpointing versus the savings in recovery obtained by taking checkpoints often. Unlike previous recovery techniques that use logging, both the logging and the message overheads are reduced. It can tolerate up to n faults, where n is the number of processes, and can be used in an environment where the cost of synchronizing the checkpoints is substantially high.

name of conference

  • Proceedings of 16th International Conference on Distributed Computing Systems

published proceedings

  • PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS

author list (cited authors)

  • Kanthadai, S., & Welch, J. L.

citation count

  • 6

complete list of authors

  • Kanthadai, S||Welch, JL

publication date

  • January 1996