A reward allocation method for reinforcement learning in stabilizing control of T-inverted pendulum | IEEE Conference Publication | IEEE Xplore