Abstract:
Failure independence is an important assumption for many fault tolerance techniques. Unfortunately, real systems exhibit correlated failures. In this paper, we present a ...Show MoreMetadata
Abstract:
Failure independence is an important assumption for many fault tolerance techniques. Unfortunately, real systems exhibit correlated failures. In this paper, we present a framework for online discovery of groups of server nodes that are maximally independent in their failure characteristics. We discuss the framework in detail and provide a preliminary evaluation.
Date of Conference: 13-16 October 2002
Date Added to IEEE Xplore: 25 February 2003
Print ISBN:0-7695-1659-9
Print ISSN: 1060-9857