Conferences >2016 IEEE-RAS 16th Internatio...

Kernel dynamic policy programming: Practical reinforcement learning for high-dimensional robots

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Applying value function based reinforcement learning algorithms to real robots has been infeasible because the approximation of high-dimensional value function is difficu...Show More

Metadata

Abstract:

Applying value function based reinforcement learning algorithms to real robots has been infeasible because the approximation of high-dimensional value function is difficult. The difficulty of such high-dimensional value function approximation in previous methods are twofold: 1) instability of value function approximation by non-smooth policy update and 2) computational complexity associated with high-dimensional state-action space. To cope with these issues, in this paper, we propose Kernel Dynamic Policy Programming (KDPP) that smoothly updates value function in an implicit high-dimensional feature space. The smooth policy update is promoted by adding the Kullback-Leibler divergence between current and updated policies in reward function as a regularization term to stabilize the value function approximation. The computational complexity is reduced by applying the kernel trick in the value function approximation. Therefore, KDPP can be interpreted as a novel yet practical extension of Dynamic Policy Programming (DPP) and kernelized value function-based reinforcement learning methods to combine the strengths of them. We successfully applied KDPP to learn unscrewing bottle cap in a Pneumatic Artificial Muscles (PAMs) driven humanoid robot hand, a system with 24 dimensional state space, with limited number of samples and commonplace computational resource.

Published in: 2016 IEEE-RAS 16th International Conference on Humanoid Robots (Humanoids)

Date of Conference: 15-17 November 2016

Date Added to IEEE Xplore: 02 January 2017

ISBN Information:

Electronic ISSN: 2164-0580

DOI: 10.1109/HUMANOIDS.2016.7803345

Conference Location: Cancun, Mexico

Contents

References is not available for this document.

Kernel dynamic policy programming: Practical reinforcement learning for high-dimensional robots

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Kernel dynamic policy programming: Practical reinforcement learning for high-dimensional robots

Alerts

Abstract:

Metadata

Abstract:

Authors

Figures

References

Citations

Keywords

Metrics

Supplemental Items

References

IEEE Account

Purchase Details

Profile Information

Need Help?