Robot Policy Learning from Demonstration Using Advantage Weighting and Early Termination | IEEE Conference Publication | IEEE Xplore