Abstract:
The design of a continuous learning controller for quadrotors often entails some specific implementations that require significant system knowledge and are prone to exper...Show MoreMetadata
Abstract:
The design of a continuous learning controller for quadrotors often entails some specific implementations that require significant system knowledge and are prone to experience catastrophic forgetting. To address these challenges, a deterministic approach is trained using a quadrotor on a relatively small amount of automatically generated data. The Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm is utilized to develop the policy for learning the maneuvers of a quadrotor and controlling it alongside the low-level controller. The algorithm outlined demonstrates proficiency in handling large state spaces and actions that are continuous. It integrates clipped double Q-learning, target policy smoothing, and delayed policy updates, all of which contribute to its effectiveness in training. The proposed control technique’s efficacy is evaluated through numerical simulations conducted on a quadrotor in both standard and windy conditions. The results identified that learning with TD3 reduced the overestimation bias, improved the convergence accuracy, and achieved efficient maneuver with less tracking error by using the dense reward structure.
Published in: 2023 27th International Conference on Methods and Models in Automation and Robotics (MMAR)
Date of Conference: 22-25 August 2023
Date Added to IEEE Xplore: 11 September 2023
ISBN Information: