Fault tolerance is important in real time systems where correct execution of tasks must satisfy certain temporal constraints. Since transient faults occur more frequently than permanent faults, the paper focuses on a transient fault tolerance algorithm, FTRMS. The FORTS group of the University of Pittsburgh derived the FTRMS algorithm and implemented it on FT-RT-Mach. The same mechanism is transferred to a commercial system, DEOS, at the Honeywell Technology Center. By describing and contrasting the implementation of FTRMS on the two systems, the paper illustrates the difference between an academic system and an industrial system. An application example shows the effect of the new transient fault tolerance scheme on the system
Published in:
Real-Time Technology and Applications Symposium, 1999. Proceedings of the Fifth IEEE
Date of Conference: 1999