By Topic

A hybrid architecture for function approximation

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

1 Author(s)
Osman, H.E. ; Osman Comput. Intell. & Syst. Sci., Tokyo Inst. of Technol., Tokyo

This paper proposes a new architecture to build a hybrid value function estimation based on a combination of temporal-different (TD) and on-line variant of random forest (RF). We call this implementation random-TD. First RF is induced into on-line mode in order to deal with large state space and memory constraints, while state-action mapping is based on the Bellman error, or on the TD error. The approach iteratively improves its value function by exploiting only relevant parts of action space. We evaluate the potential of the proposed procedure in terms of a reduction in the Bellman error with extended empirical studies on high-dimensional control problems (Ailerons, Elevator, Kinematics, and Friedman). The results demonstrate that our approach can significantly improve the performance of TD methods and speed up learning process.

Published in:

Industrial Informatics, 2008. INDIN 2008. 6th IEEE International Conference on

Date of Conference:

13-16 July 2008