Closed-Loop Aggregated Baseline Load Estimation Using Contextual Bandit With Policy Gradient | IEEE Journals & Magazine | IEEE Xplore