1 code implementation • ICLR 2020 • Pan Xu, Felicia Gao, Quanquan Gu
Improving the sample efficiency in reinforcement learning has been a long-standing research problem.
no code implementations • 29 May 2019 • Pan Xu, Felicia Gao, Quanquan Gu
We revisit the stochastic variance-reduced policy gradient (SVRPG) method proposed by Papini et al. (2018) for reinforcement learning.