Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-Part II: Markovian rewards

Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-Part II: Markovian rewards | IEEE Journals & Magazine | IEEE Xplore

IEEE Account

Purchase Details

Profile Information

Need Help?