Off-policy reinforcement learning with Gaussian processes | IEEE Journals & Magazine | IEEE Xplore