Upper Confidence Interval Strategies for Multi-Armed Bandits with Entropy Rewards | IEEE Conference Publication | IEEE Xplore