Abstract:
This paper introduces Locally Weighted Least Squares Policy Iteration for learning approximate optimal control in settings where models of the dynamics and cost function ...Show MoreMetadata
Abstract:
This paper introduces Locally Weighted Least Squares Policy Iteration for learning approximate optimal control in settings where models of the dynamics and cost function are either unavailable or hard to obtain. Building on recent advances in Least Squares Temporal Difference Learning, the proposed approach is able to learn from data collected from interactions with a system, in order to build a global control policy based on localised models of the state-action value function. Evaluations are reported characterising learning performance for non-linear control problems including an under-powered pendulum swing-up task, and a robotic door-opening problem under different dynamical conditions.
Date of Conference: 03-07 November 2013
Date Added to IEEE Xplore: 02 January 2014
ISBN Information:
ISSN Information:
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Least-squares ,
- Weighted Least Squares ,
- Local Weights ,
- Model-free Learning ,
- Dynamic Model ,
- Dynamical ,
- Value Function ,
- Cost Function ,
- Optimal Control ,
- Control Problem ,
- Nonlinear Problem ,
- Temporal Differences ,
- Control Setting ,
- Nonlinear Control ,
- Nonlinear Control Problem ,
- Temporal Difference Learning ,
- Global Model ,
- Angular Velocity ,
- Weight Function ,
- Radial Basis Function ,
- Policy Improvement ,
- Unknown Dynamics ,
- Unstructured Environments ,
- Policy Evaluation ,
- Step Rate ,
- Angular Position ,
- Viscous Friction ,
- Use Of Learning Techniques ,
- Number Of Basis Functions ,
- Localization Techniques
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Least-squares ,
- Weighted Least Squares ,
- Local Weights ,
- Model-free Learning ,
- Dynamic Model ,
- Dynamical ,
- Value Function ,
- Cost Function ,
- Optimal Control ,
- Control Problem ,
- Nonlinear Problem ,
- Temporal Differences ,
- Control Setting ,
- Nonlinear Control ,
- Nonlinear Control Problem ,
- Temporal Difference Learning ,
- Global Model ,
- Angular Velocity ,
- Weight Function ,
- Radial Basis Function ,
- Policy Improvement ,
- Unknown Dynamics ,
- Unstructured Environments ,
- Policy Evaluation ,
- Step Rate ,
- Angular Position ,
- Viscous Friction ,
- Use Of Learning Techniques ,
- Number Of Basis Functions ,
- Localization Techniques