By Topic

Probabilistic Fault-tolerance of Distributed Services: A Paradigm for Dependable Applications

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

1 Author(s)
Ravindran, K. ; Dept. of Comput. Sci., City Univ. of New York, New York, NY, USA

The paper deals with a study of probabilistic methods to manage the dependability of a networked distributed system, in the presence of inaccuracies and partial knowledge of system models pertaining to failures. A distributed networked system (DIS) that collects raw data from sensors deployed in the field and delivers a composite data to an end-user is faced with two types of uncertainties: at 'information level' due to the multi-modal nature of raw data collected from the environment, and at 'control level' due to the incompleteness in knowledge about the application model. These have a compounded effect on the quality of fault-tolerance exhibited by a DIS. Based on service-layer abstractions, the paper identifies application-oriented metrics to quantify the quality of information flowing through a DIS. Even with imperfect information, the paper demonstrates how the high-level quality metrics and control algorithms enable achieving a reasonable degree of fault-tolerance in a probabilistic manner. A case study of replicated web services is also described.

Published in:

Availability, Reliability and Security (ARES), 2011 Sixth International Conference on

Date of Conference:

22-26 Aug. 2011