Adaptive Optimal Control via Q-Learning for Multi-Agent Pursuit-Evasion Games | IEEE Journals & Magazine | IEEE Xplore