Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-Part I: I.I.D. rewards | IEEE Journals & Magazine | IEEE Xplore