Trajectory Based Prioritized Double Experience Buffer for Sample-Efficient Policy Optimization | IEEE Journals & Magazine | IEEE Xplore