Reinforcement-Learning-Based Multi-Unmanned Aerial Vehicle Optimal Control for Communication Services With Limited Endurance | IEEE Journals & Magazine | IEEE Xplore