An Improved Exploration Method for Cooperative Multi-UAV Policy Learning with Sparse Rewards | IEEE Conference Publication | IEEE Xplore