Stochastic kernel temporal difference for reinforcement learning | IEEE Conference Publication | IEEE Xplore