Scheduled System Maintenance:
On May 6th, system maintenance will take place from 8:00 AM - 12:00 PM ET (12:00 - 16:00 UTC). During this time, there may be intermittent impact on performance. We apologize for the inconvenience.
By Topic

Utilizing Learning Automata and Entropy to Improve the Exploration Power of Rescue Agents

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

The purchase and pricing options are temporarily unavailable. Please try again later.
3 Author(s)
Masoumi, B. ; Dept. of Comput. Eng., Islamic Azad Univ., Qazvin, Iran ; Asghari, M. ; Meybodi, M.R.

Rescue Simulation System is an example of multi-agent systems in which we encounter many challenges. One of these challenges is to having Tradeoff between exploration and exploitation in path planning phase. In this paper we present an exploration method based on variable structure S model learning automaton which uses the entropy of action's probability vector as a criteria to give reward or to penalize its selected action. This method can leads agents to establish a logical balance between exploration and exploitation too. The results show that the proposed method has good performance from both exploration and acquired final score point of view in rescue simulation system.

Published in:

Intelligent Systems (GCIS), 2010 Second WRI Global Congress on  (Volume:1 )

Date of Conference:

16-17 Dec. 2010