Enhancing Exploration With Diffusion Policies in Hybrid Off-Policy RL: Application to Non-Prehensile Manipulation | IEEE Journals & Magazine | IEEE Xplore