no code implementations • 3 Nov 2020 • Gavin McCracken, Colin Daniels, Rosie Zhao, Anna Brandenberger, Prakash Panangaden, Doina Precup
Policy gradient methods are extensively used in reinforcement learning as a way to optimize expected return.
Policy Gradient Methods