Real-Time Distributed Discrete-Event Execution with Fault Tolerance
Feng, T.H.
Lee, E.A.
Dept. of Electr. Eng. & Comput. Sci., California Univ., Berkeley, CA;
This paper appears in: Real-Time and Embedded Technology and Applications Symposium, 2008. RTAS '08. IEEE
Publication Date: 22-24 April 2008
On page(s): 205-214
Location: St. Louis, MO,
ISSN: 1080-1812
ISBN: 978-0-7695-3146-5
INSPEC Accession Number: 10060308
Digital Object Identifier: 10.1109/RTAS.2008.22
Current Version Published: 2008-06-27
Abstract
We build on PTIDES, a programming model for distributed embedded systems that uses discrete-event (DE) models as program specifications. PTIDES improves on distributed DE execution by allowing more concurrent event processing without backtracking. This paper discusses the general execution strategy for PTIDES, and provides two feasible implementations. This execution strategy is then extended with tolerance for hardware errors. We take a program transformation approach to automatically enhance DE models with incremental checkpointing and state recovery functionality. Our fault tolerance mechanism is lightweight and has low overhead. It requires very little human intervention. We incorporate this mechanism into PTIDES for efficient execution of fault- tolerant real-time distributed DE systems.
Index
Terms
Available to subscribers and IEEE members.
References
Available to subscribers and IEEE members.
Citing Documents
Available to subscribers and IEEE members.