Residual Policy Optimization with Trust Region Constraints: A Learning Framework for Stable and Agile Wheel-Legged Locomotion

Residual Policy Optimization with Trust Region Constraints: A Learning Framework for Stable and Agile Wheel-Legged Locomotion | IEEE Journals & Magazine | IEEE Xplore

IEEE Account

Purchase Details

Profile Information

Need Help?