no code implementations • 3 Nov 2020 • Gavin McCracken, Colin Daniels, Rosie Zhao, Anna Brandenberger, Prakash Panangaden, Doina Precup
Policy gradient methods are extensively used in reinforcement learning as a way to optimize expected return.
no code implementations • 20 Feb 2020 • David Venuto, Jhelum Chakravorty, Leonard Boussioux, Junhao Wang, Gavin McCracken, Doina Precup
Explicit engineering of reward functions for given environments has been a major hindrance to reinforcement learning methods.