Search Results for author: James Simon

Found 1 papers, 0 papers with code

Critical Point-Finding Methods Reveal Gradient-Flat Regions of Deep Network Losses

no code implementations • 23 Mar 2020 • Charles G. Frye, James Simon, Neha S. Wadia, Andrew Ligeralde, Michael R. DeWeese, Kristofer E. Bouchard

Despite the fact that the loss functions of deep neural networks are highly non-convex, gradient-based optimization algorithms converge to approximately the same performance from many random initial points.

Second-order methods

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.