Loading web-font TeX/Caligraphic/Regular
Model Learning Predictive Control in Nonlinear Dynamical Systems | IEEE Conference Publication | IEEE Xplore

Model Learning Predictive Control in Nonlinear Dynamical Systems


Abstract:

We study the problem of online learning and control in partially observable nonlinear dynamical systems, where the model dynamics are unknown and the controlling agent ha...Show More

Abstract:

We study the problem of online learning and control in partially observable nonlinear dynamical systems, where the model dynamics are unknown and the controlling agent has only access to the system outputs. We propose Model Learning Predictive Control (MLPC), an efficient online control framework that learns to control the unknown system and minimizes the overall control cost. MLPC employs Random Fourier Features (RFF) to represent the nonlinear system dynamics and learns the underlying system up to a confidence interval. Once a reliable estimate of the dynamics is obtained, MLPC deploys an MPC oracle with the estimated system dynamics for planning. MLPC occasionally updates the underlying model estimates and improves the accuracy and the effectiveness of the MPC policies. We derive a novel finite-time approximation error bound under RFF learning and provide stability guarantees for single trajectory online control. We show that MLPC attains \tilde {\mathcal{O}}\left( {{T^{2/3}}} \right) regret after T time steps in online control of stable partially observable nonlinear systems against the controller that uses the same MPC oracle with the true system dynamics. We empirically demonstrate the performance of MLPC on the inverted pendulum task and show the flexibility of the proposed general framework via deploying different planning strategies for the controller design to achieve low-cost control policies.
Date of Conference: 14-17 December 2021
Date Added to IEEE Xplore: 01 February 2022
ISBN Information:

ISSN Information:

Conference Location: Austin, TX, USA

Contact IEEE to Subscribe

References

References is not available for this document.