Abstract
A method of execution retry for bypassing software faults based on
checkpointing, rollback, message reordering, and replaying is described.
The authors demonstrate how rollback techniques, previously developed
for transient hardware failure recovery, can also be used to recover
from software errors by exploiting message reordering to bypass software
faults. The approach intentionally increases the degree of
nondeterminism and the scope of rollback when a previous retry fails.
Examples from experience with telecommunications software systems
illustrate the benefits of the scheme
Index
Terms
Available to subscribers and IEEE members.
References
Available to subscribers and IEEE members.
Citing Documents
Available to subscribers and IEEE members.