An analysis of temporal-difference learning with function approximation | IEEE Journals & Magazine | IEEE Xplore