Reinforcement learning algorithms for semi-Markov decision processes with average reward | IEEE Conference Publication | IEEE Xplore