Reinforcement learning-based dynamic bandwidth provisioning for quality of service in differentiated services networks | IEEE Conference Publication | IEEE Xplore