Search Results for author: Alexander Golubev

Found 1 papers, 0 papers with code

Variance Reduction for Policy-Gradient Methods via Empirical Variance Minimization

no code implementations14 Jun 2022 Maxim Kaledin, Alexander Golubev, Denis Belomestny

Policy-gradient methods in Reinforcement Learning(RL) are very universal and widely applied in practice but their performance suffers from the high variance of the gradient estimate.

Policy Gradient Methods Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.