Search Results for author: Jinghan Wang

Found 1 papers, 0 papers with code

Near Sample-Optimal Reduction-based Policy Learning for Average Reward MDP

no code implementations1 Dec 2022 Jinghan Wang, Mengdi Wang, Lin F. Yang

This work considers the sample complexity of obtaining an $\varepsilon$-optimal policy in an average reward Markov Decision Process (AMDP), given access to a generative model (simulator).

Cannot find the paper you are looking for? You can Submit a new open access paper.