By Topic

Failure Prediction Mechanisms in Cluster Systems

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Sharifi, M. ; Comput. Eng. Dept., Iran Univ. of Sci. & Technol., Tehran ; Hamedi, S.A.

Clustering is an important technique for improving the performance and availability of computer systems. The use of cluster systems is also continuously growing because they present excellent features like scalability, high availability and high performance computing. Availability is mainly administered by failure detection and recovery mechanism, including proactive failure mechanisms that try to prevent occurrences of faults. Given the criticality and importance of availability for high performance computing, this paper uniquely surveyes noticeable existing mechanisms for prevention of faults in high availability and high performance computing cluster systems, and presents a comparative overview.

Published in:

Biocomputation, Bioinformatics, and Biomedical Technologies, 2008. BIOTECHNO '08. International Conference on

Date of Conference:

June 29 2008-July 5 2008