A learning algorithm for the finite-time two-armed bandit problem | IEEE Journals & Magazine | IEEE Xplore