OPIRL: Sample Efficient Off-Policy Inverse Reinforcement Learning via Distribution Matching | IEEE Conference Publication | IEEE Xplore