By Topic

A communication-induced checkpointing algorithm using virtual checkpoint on distributed systems

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Kim Do-Hyung ; Electron. & Telecommun. Res. Inst., South Korea ; Park Chang-Soon

Checkpointing is a fault-tolerant technique for restoring faults and restarting jobs quickly. The algorithms for checkpointing on distributed systems have been under study for years. These algorithms can be classified into three types: coordinated, uncoordinated and communication-induced algorithms. In this paper we propose a new communication-induced checkpointing algorithm that has a minimum checkpointing count equivalent to the periodic checkpointing algorithm, and relatively short rollback distance at fault situations. The proposed algorithm is compared with the previously proposed communication-induced checkpointing algorithms with simulation results. In the simulation, the proposed algorithm produces better performance than other algorithms in terms of task completion time in both fault-free and fault situations

Published in:

Parallel and Distributed Systems, 2000. Proceedings. Seventh International Conference on

Date of Conference:

2000