Search Results for author: Brett W. Larsen

Found 6 papers, 4 papers with code

How many degrees of freedom do we need to train deep networks: a loss landscape perspective

1 code implementation ICLR 2022 Brett W. Larsen, Stanislav Fort, Nic Becker, Surya Ganguli

In particular, we show via Gordon's escape theorem, that the training dimension plus the Gaussian width of the desired loss sub-level set, projected onto a unit sphere surrounding the initialization, must exceed the total number of parameters for the success probability to be large.

Lottery Tickets on a Data Diet: Finding Initializations with Sparse Trainable Networks

1 code implementation2 Jun 2022 Mansheej Paul, Brett W. Larsen, Surya Ganguli, Jonathan Frankle, Gintare Karolina Dziugaite

A striking observation about iterative magnitude pruning (IMP; Frankle et al. 2020) is that $\unicode{x2014}$ after just a few hundred steps of dense training $\unicode{x2014}$ the method can find a sparse sub-network that can be trained to the same accuracy as the dense network.

Estimating Shape Distances on Neural Representations with Limited Samples

1 code implementation9 Oct 2023 Dean A. Pospisil, Brett W. Larsen, Sarah E. Harvey, Alex H. Williams

Measuring geometric similarity between high-dimensional network representations is a topic of longstanding interest to neuroscience and deep learning.

Avoiding Spurious Local Minima in Deep Quadratic Networks

1 code implementation31 Dec 2019 Abbas Kazemipour, Brett W. Larsen, Shaul Druckmann

Despite their practical success, a theoretical understanding of the loss landscape of neural networks has proven challenging due to the high-dimensional, non-convex, and highly nonlinear structure of such models.

Unmasking the Lottery Ticket Hypothesis: What's Encoded in a Winning Ticket's Mask?

no code implementations6 Oct 2022 Mansheej Paul, Feng Chen, Brett W. Larsen, Jonathan Frankle, Surya Ganguli, Gintare Karolina Dziugaite

Third, we show how the flatness of the error landscape at the end of training determines a limit on the fraction of weights that can be pruned at each iteration of IMP.

Duality of Bures and Shape Distances with Implications for Comparing Neural Representations

no code implementations19 Nov 2023 Sarah E. Harvey, Brett W. Larsen, Alex H. Williams

A multitude of (dis)similarity measures between neural network representations have been proposed, resulting in a fragmented research landscape.

Cannot find the paper you are looking for? You can Submit a new open access paper.