Logarithmic Online Regret Bounds for Undiscounted Reinforcement Learning

Logarithmic Online Regret Bounds for Undiscounted Reinforcement Learning | part of Advances in Neural Information Processing Systems 19: Proceedings of the 2006 Conference | MIT Press books | IEEE Xplore

IEEE Account

Purchase Details

Profile Information

Need Help?