Search Results for author: Bekzhan Kerimkulov

Found 2 papers, 0 papers with code

A Fisher-Rao gradient flow for entropy-regularised Markov decision processes in Polish spaces

no code implementations • 4 Oct 2023 • Bekzhan Kerimkulov, James-Michael Leahy, David Siska, Lukasz Szpruch, Yufei Zhang

We study the global convergence of a Fisher-Rao policy gradient flow for infinite-horizon entropy-regularised Markov decision processes with Polish state and action space.

LEMMA

Paper
Add Code

Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime

no code implementations • 18 Jan 2022 • Bekzhan Kerimkulov, James-Michael Leahy, David Šiška, Lukasz Szpruch

We show that the objective function is increasing along the gradient flow.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.