Value learning from trajectory optimization and Sobolev descent: A step toward reinforcement learning with superlinear convergence properties | IEEE Conference Publication | IEEE Xplore