no code implementations • 6 Sep 2021 • Ning Wei, Jiahua Liang, Di Xie, ShiLiang Pu
Designing optimal reward functions has been desired but extremely difficult in reinforcement learning (RL).
Deep Reinforcement Learning reinforcement-learning +1