1 code implementation • 6 Jun 2019 • Patrick Nadeem Ward, Ariella Smofsky, Avishek Joey Bose
Deep Reinforcement Learning (DRL) algorithms for continuous action spaces are known to be brittle toward hyperparameters as well as \cut{being}sample inefficient.