Multi-Agent Reinforcement Learning With Spatial–Temporal Attention for Flocking With Collision Avoidance of a Scalable Fixed-Wing UAV Fleet | IEEE Journals & Magazine | IEEE Xplore