Search Results for author: Loren Amdahl-Culleton

Found 1 papers, 0 papers with code

All-Action Policy Gradient Methods: A Numerical Integration Approach

no code implementations • 21 Oct 2019 • Benjamin Petit, Loren Amdahl-Culleton, Yao Liu, Jimmy Smith, Pierre-Luc Bacon

While often stated as an instance of the likelihood ratio trick [Rubinstein, 1989], the original policy gradient theorem [Sutton, 1999] involves an integral over the action space.

Continuous Control Numerical Integration +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.