Relative Q-learning for Average-Reward Markov Decision Processes with Continuous States | IEEE Journals & Magazine | IEEE Xplore