no code implementations • 1 Jul 2020 • Shuai Han, Wenbo Zhou, Shuai Lü, Jiayu Yu
Deep Deterministic Policy Gradient (DDPG) algorithm is one of the most well-known reinforcement learning methods.
MuJoCo Q-Learning