On-Policy Reinforcement Learning via Ensemble Gaussian Processes with Application to Resource Allocation | IEEE Conference Publication | IEEE Xplore