Sample efficient transfer in reinforcement learning for high variable cost environments with an inaccurate source reward model | IEEE Conference Publication | IEEE Xplore