no code implementations • 18 May 2019 • Aleksandra Faust, Anthony Francis, Dar Mehta
Many continuous control tasks have easily formulated objectives, yet using them directly as a reward in reinforcement learning (RL) leads to suboptimal policies.
Continuous Control Hyperparameter Optimization +2