Conferences >2020 IEEE International Confe...

Dynamic Actor-Advisor Programming for Scalable Safe Reinforcement Learning

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Real-world robots have complex strict constraints. Therefore, safe reinforcement learning algorithms that can simultaneously minimize the total cost and the risk of const...Show More

Metadata

Abstract:

Real-world robots have complex strict constraints. Therefore, safe reinforcement learning algorithms that can simultaneously minimize the total cost and the risk of constraint violation are crucial. However, almost no algorithms exist that can scale to high-dimensional systems to the best of our knowledge. In this paper, we propose Dynamic Actor-Advisor Programming (DAAP), as an algorithm for sample-efficient and scalable safe reinforcement learning. DAAP employs two control policies, actor and advisor. They are updated to minimize total cost and risk of constraint violation intertwiningly and smoothly towards each other's direction by using the other as the baseline policy in the Kullback-Leibler divergence of Dynamic Policy Programming framework. We demonstrate the scalability and sample efficiency of DAAP through its application on simulated robot arm control tasks with performance comparisons to baselines.

Published in: 2020 IEEE International Conference on Robotics and Automation (ICRA)

Date of Conference: 31 May 2020 - 31 August 2020

Date Added to IEEE Xplore: 15 September 2020

ISBN Information:

ISSN Information:

DOI: 10.1109/ICRA40945.2020.9197200

Conference Location: Paris, France

Contents

References is not available for this document.

Dynamic Actor-Advisor Programming for Scalable Safe Reinforcement Learning

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Dynamic Actor-Advisor Programming for Scalable Safe Reinforcement Learning

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Authors

Figures

References

Citations

Keywords

Metrics

Supplemental Items

References

IEEE Account

Purchase Details

Profile Information

Need Help?