By Topic

A distributed deterministic help scheme to improve the system fault tolerance

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Mirian, M.S. ; Control and Intelligent Processing Center of Excellence and AI and Robotics Lab, Dept. of ECE, University of Tehran, Iran ; Ahmadabadi, M.N.

This paper proposes some methods to improve the fault-tolerance in distributed systems specifically in deterministic situations by distributed decision-making and coordination. Providing a distributed system with fault tolerance is a feasible but hard problem due to the intrinsic aspects of such systems such as: independency, unpredictability and communication problem. A multi-agent system as an instance of distributed system, can handle different kinds of fault by using traditional fault tolerance techniques. But what focused in this paper are agent-based techniques. In fact proposed methods are based on agent-based help provision by distributed cooperation among helper agents. The helpers try to tune their normal roles such that they can undertake the faulty agents' tasks too. These helpers go through a sub-optimal task selection algorithm, to decide whom to help. It is important to remark that there is no explicit interaction; instead they coordinate their decisions implicitly by adopting the most appropriate task in terms of their speed, relative reliability and the task's criticality coefficient. Proposed ideas are tested on a DCS-like tested to improve its fault tolerance. The results illustrate the effectiveness of the approaches in comparison to the case of no help situation and the case of using purely redundant components.

Published in:

Automation Congress, 2004. Proceedings. World  (Volume:16 )

Date of Conference:

June 28 2004-July 1 2004