Efficient Dialog Policy Learning With Hindsight, User Modeling, and Adaptation | IEEE Journals & Magazine | IEEE Xplore