Reinforcement learning in continuous time: advantage updating | IEEE Conference Publication | IEEE Xplore