An actor-critic method using Least Squares Temporal Difference learning | IEEE Conference Publication | IEEE Xplore