New value iteration and Q-learning methods for the average cost dynamic programming problem | IEEE Conference Publication | IEEE Xplore