Least squares temporal difference actor-critic methods with applications to robot motion control | IEEE Conference Publication | IEEE Xplore