Search Results for author: John W. Roberts

Found 1 papers, 0 papers with code

Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

no code implementations NeurIPS 2008 John W. Roberts, Russ Tedrake

Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the estimate of the gradient.

Cannot find the paper you are looking for? You can Submit a new open access paper.