High-Reliability Multi-Agent Q-Learning-Based Scheduling for D2D Microgrid Communications | IEEE Journals & Magazine | IEEE Xplore