By Topic

Reliability considerations in large-scale computing systems

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
W. Majjar ; Inf. Sci. Inst., Marina del Rey, CA, USA ; J. Gaudiot

The authors address the issue of scalability in the reliability analysis of large-scale degradable homogeneous multiprocessors. The main motivation behind this analysis stems from consideration of whether the reliability of such systems imposes a limit on the number of processors that can cooperate on one problem. Traditional techniques of reliability and performability analysis are used to evaluate the asymptotic behavior of measures such as the mean time to failure and the mission time. The concept of computational reliability is used as a tool to evaluate the measure of reliable computational work as a function of the number of processors. It is shown that the number of processor hours a realistic system can deliver is upper bounded independently from the number of processors. The results demonstrate that graceful degradation in large-scale systems is not scalable; an increase in the number of processors must be matched by a significant increase in the coverage factor in order to maintain the same performance and reliability levels

Published in:

Frontiers of Massively Parallel Computation, 1988. Proceedings., 2nd Symposium on the Frontiers of

Date of Conference:

10-12 Oct 1988