Reinforcement learning is direct adaptive optimal control | IEEE Journals & Magazine | IEEE Xplore