no code implementations • 25 Jun 2019 • Long Yang, Yu Zhang, Gang Zheng, Qian Zheng, Pengfei Li, Jianhang Huang, Jun Wen, Gang Pan
Improving sample efficiency has been a longstanding goal in reinforcement learning.
Continuous Control Policy Gradient Methods +2