Search Results for author: Gautam Chandrasekaran

Found 1 papers, 0 papers with code

Learning in Online MDPs: Is there a Price for Handling the Communicating Case?

no code implementations3 Nov 2021 Gautam Chandrasekaran, Ambuj Tewari

In contrast, it has been shown that handling online MDPs with communicating structure and bandit information incurs $\Omega(T^{2/3})$ regret even in the case of deterministic transitions.

Cannot find the paper you are looking for? You can Submit a new open access paper.