1 code implementation • 5 May 2021 • D. Belomestny, I. Levin, E. Moulines, A. Naumov, S. Samsonov, V. Zorina
Policy evaluation is an important instrument for the comparison of different algorithms in Reinforcement Learning (RL).
1 code implementation • 8 Oct 2019 • D. Belomestny, L. Iosipoi, E. Moulines, A. Naumov, S. Samsonov
In this paper we propose a novel variance reduction approach for additive functionals of Markov chains based on minimization of an estimate for the asymptotic variance of these functionals over suitable classes of control variates.
no code implementations • 18 Mar 2019 • D. Belomestny, E. Moulines, S. Samsonov
In this paper we propose an efficient variance reduction approach for additive functionals of Markov chains relying on a novel discrete time martingale representation.
no code implementations • 13 Dec 2017 • D. Belomestny, L. Iosipoi, Q. Paris, N. Zhivotovskiy
We study the problem of empirical minimization for variance-type functionals over functional classes.