Search Results for author: Itai Shufaro

Found 1 papers, 0 papers with code

On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes

no code implementations11 Mar 2024 Navdeep Kumar, Yashaswini Murthy, Itai Shufaro, Kfir Y. Levy, R. Srikant, Shie Mannor

We present the first finite time global convergence analysis of policy gradient in the context of infinite horizon average reward Markov decision processes (MDPs).

Cannot find the paper you are looking for? You can Submit a new open access paper.