Inferring Cost Functions Using Reward Parameter Search and Policy Gradient Reinforcement Learning | IEEE Conference Publication | IEEE Xplore