Expert-based reward shaping and exploration scheme for boosting policy learning of dialogue management | IEEE Conference Publication | IEEE Xplore