Optimization of Resource Allocation in Multi-Link P2P Communication Systems Based on Reinforcement Learning | IEEE Conference Publication | IEEE Xplore