Search Results for author: Thomas Jaksch

Found 1 papers, 0 papers with code

Near-optimal Regret Bounds for Reinforcement Learning

no code implementations NeurIPS 2008 Peter Auer, Thomas Jaksch, Ronald Ortner

For undiscounted reinforcement learning in Markov decision processes (MDPs) we consider the total regret of a learning algorithm with respect to an optimal policy.

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.