By Topic

A Reinforcement Learning Algorithm Based on Minimum State Method and Average Reward

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

4 Author(s)
Fu Qi-ming ; Inst. of Comput. Sci. & Technol., Soochow Univ., Suzhou, China ; Liu Quan ; Cui Zhi-ming ; Fu Yu-chen

Reinforcement learning is learning what to do - how to map situations to actions - so as to maximize a numerical reward signal. In allusion to the problem that Q-Learning, which uses discount reward as the evaluation criterion, cannot show the affect of the action to the next situation, the paper puts forward AR-Q-Learning based on the average reward and Q-Learning. In allusion to the Curse Of Dimensionality, which means that the computational requirements grow exponentially with the number of state variables, the paper puts forward Minimum State Method. AR-Q-Learning and Minimum State Method are used in the reinforcement learning of Blocks World, and the result of the experiment shows that the method has the characteristic of after effect and converge more faster than Q-Learning, and at the same time, solve the Curse Of Dimensionality in Blocks World in a certain extent.

Published in:

Computer Science and Information Engineering, 2009 WRI World Congress on  (Volume:5 )

Date of Conference:

March 31 2009-April 2 2009