Robot behavior adaptation for human-robot interaction based on policy gradient reinforcement learning | IEEE Conference Publication | IEEE Xplore