Search Results for author: John W. Roberts

Found 1 papers, 0 papers with code

Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

no code implementations • NeurIPS 2008 • John W. Roberts, Russ Tedrake

Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the estimate of the gradient.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.