Deep Reinforcement Learning for Dynamic Bandwidth Allocation in Multi-Beam Satellite Systems | IEEE Conference Publication | IEEE Xplore