Reinforcement Learning for Compensating Power Excursions in Amplified WDM Systems | IEEE Journals & Magazine | IEEE Xplore