no code implementations • 2 Jun 2022 • Kiwon Lee, Andrew N. Cheng, Courtney Paquette, Elliot Paquette
We analyze the dynamics of large batch stochastic gradient descent with momentum (SGD+M) on the least squares problem when both the number of samples and dimensions are large.