Reliability of checkpointed real-time systems using time redundancy
Krishna, C.M.
Singh, A.D.
Dept. of Electr. & Comput. Eng., Massachusetts Univ., Amherst, MA;
This paper appears in: Reliability, IEEE Transactions on
Publication Date: Sep 1993
Volume: 42,
Issue: 3
On page(s): 427-435
ISSN: 0018-9529
References Cited: 13
CODEN: IERQAD
INSPEC Accession Number: 4592651
Digital Object Identifier: 10.1109/24.257826
Current Version Published: 2002-08-06
Abstract
Real-time computers are often used in embedded, life-critical
applications where high reliability is important. A common approach to
making such systems dependable is to vote on redundant processors
executing multiple copies of the same task is described. The processors
which make up such voted systems are subjected not only to independently
occurring permanent and transient failure, but also to correlated
transients brought about by electromagnetic interference from the
operating environment. To counteract these transients, checkpointing and
time redundancy are required, in addition to processor redundancy. This
work analyzes the use of time and device redundancy in systems subject
to correlated failure. The tradeoffs in checkpoint placement in such a
system are found to be considerably different from those for
non-redundant systems without real-time constraints. The authors compare
fault-tolerant designs and without a rollback capability, accounting for
the increased hardware-failure rate due to processor duplication when
faults are detected in hardware, and the doubled execution times when
detection is implemented in software
Index
Terms
Available to subscribers and IEEE members.
References
Available to subscribers and IEEE members.
Citing Documents
Available to subscribers and IEEE members.