Online learning control law design based on policy gradient reinforcement learning | IEEE Conference Publication | IEEE Xplore