Towards High Level Skill Learning: Learn to Return Table Tennis Ball Using Monte-Carlo Based Policy Gradient Method | IEEE Conference Publication | IEEE Xplore