Introspective failure analysis: avoiding correlated failures in peer-to-peer systems
Weatherspoon, H.
Moscovitz, T.
Kubiatowicz, J.
Div. of Comput. Sci., California Univ., Berkeley, CA, USA;
This paper appears in: Reliable Distributed Systems, 2002. Proceedings. 21st IEEE Symposium on
Publication Date: 2002
On page(s): 362- 367
ISSN: 1060-9857
ISBN: 0-7695-1659-9
INSPEC Accession Number: 7516799
Digital Object Identifier: 10.1109/RELDIS.2002.1180211
Current Version Published: 2003-02-25
Abstract
Failure independence is an important assumption for many fault tolerance techniques. Unfortunately, real systems exhibit correlated failures. In this paper, we present a framework for online discovery of groups of server nodes that are maximally independent in their failure characteristics. We discuss the framework in detail and provide a preliminary evaluation.
Index
Terms
Available to subscribers and IEEE members.
References
Available to subscribers and IEEE members.
Citing Documents
Available to subscribers and IEEE members.