Learning control of finite Markov chains with unknown transition probabilities | IEEE Journals & Magazine | IEEE Xplore