no code implementations • 27 Mar 2018 • Ashley D. Edwards, Laura Downs, James C. Davidson
If we relax this one restriction and endow the agent with knowledge of the reward function, and in particular of the goal, we can leverage backwards induction to accelerate training.