By Topic

Abstraction of state-action space utilizing properties of the body and the environmentL — Application to a 3-dimensional snake-like robot that operates on rubble

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
Ito, K. ; HOSEI Univ., Tokyo, Japan ; Kuroe, S. ; Kobayashi, T.

In this paper, we address the autonomous control of a 3-dimensional snake-like robot by using reinforcement learning, and we apply it in the case of rubble. In general, snake-like robots have high mobility that is realized by many degrees of freedom, and they can move on rubble. However, the many degrees of freedom cause the state explosion problem, and the complexity of the rubble results in incomplete learning. Therefore, it is impossible to apply reinforcement learning to conventional snake-like robots that move on rubble. In this paper, to solve these problems, we focus on properties of the real environment and the dynamics of a mechanical body. We design the body of the robot for abstracting the necessary small state-action space by considering real-world properties, and we make it possible to apply reinforcement learning. To demonstrate the effectiveness of the proposed snake-like robot, we conducted experiments where learning was completed within reasonable time and the robot effectively adapted itself to an unknown 3-dimensional environment.

Published in:

Intelligent Systems (IS), 2012 6th IEEE International Conference

Date of Conference:

6-8 Sept. 2012