Crash Fault Detection in Celerating Environments Conference Paper

Overview
Identity
Additional Document Info
Other
View All

abstract

Failure detectors are a service that provides (approximate) information about process crashes in a distributed system. The well-known "eventually perfect" failure detector, P, has been implemented in partially synchronous systems with unknown upper bounds on message delay and relative process speeds. However, previous implementations have overlooked an important subtlety with respect to measuring the passage of time in "celerating" environments, in which absolute process speeds can continually increase or decrease while maintaining bounds on relative process speeds. Existing implementations either use action clocks, which fail in accelerating environments, or use real-time clocks, which fail in decelerating environments. We propose the use of bichronal clocks, which are a composition of action clocks and real-time clocks. Our solution can be readily adopted to make existing implementations of P robust to process celeration, which can result from hardware upgrades, server overloads, denial-of-service attacks, and other system volatilities. 2009 IEEE.

name of conference

2009 IEEE International Symposium on Parallel & Distributed Processing

authors

Welch, Jennifer

published proceedings

2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5

author list (cited authors)

Sastry, S., Pike, S. M., & Welch, J. L.

citation count

4

complete list of authors

Sastry, Srikanth||Pike, Scott M||Welch, Jennifer L

publication date

January 2009

publisher

Institute of Electrical and Electronics Engineers (IEEE) Publisher

published in

Proceedings of the International Parallel and Distributed Processing Symposium, IPDPS Journal