no code implementations • 6 Jun 2019 • Matthew Rahtz, James Fang, Anca D. Dragan, Dylan Hadfield-Menell
In deep reinforcement learning, for example, directly specifying a reward as a function of a high-dimensional observation is challenging.
Reinforcement Learning (RL)