A Reinforcement Learning Sampling Optimization Method Based on Training Value | IEEE Conference Publication | IEEE Xplore