Progressive retry for software error recovery in distributedsystems
Wang, Y.-M.; Huang, Y.; Fuchs, W.K.
Fault-Tolerant Computing, 1993. FTCS-23. Digest of Papers., The Twenty-Third International Symposium on
Volume , Issue , 22-24 Jun 1993 Page(s):138 - 144
Digital Object Identifier 10.1109/FTCS.1993.627317
Summary:A method of execution retry for bypassing software faults based on
checkpointing, rollback, message reordering, and replaying is described.
The authors demonstrate how rollback techniques, previously developed
for transient hardware failure recovery, can also be used to recover
from software errors by exploiting message reordering to bypass software
faults. The approach intentionally increases the degree of
nondeterminism and the scope of rollback when a previous retry fails.
Examples from experience with telecommunications software systems
illustrate the benefits of the scheme
View citation and abstract |