Efficient Bayesian Policy Reuse With a Scalable Observation Model in Deep Reinforcement Learning | IEEE Journals & Magazine | IEEE Xplore