Search Results for author: Matthew Leavitt

Found 3 papers, 1 papers with code

On the special role of class-selective neurons in early training

no code implementations • 27 May 2023 • Omkar Ranadive, Nikhil Thakurdesai, Ari S Morcos, Matthew Leavitt, Stéphane Deny

Finally, in causal experiments where we regularize against class selectivity at different points in training, we show that the presence of class-selective neurons early in training is critical to the successful training of the network; in contrast, class-selective neurons can be suppressed later in training with little effect on final accuracy.

Paper
Add Code

Knowledge Distillation for Efficient Sequences of Training Runs

no code implementations • 11 Mar 2023 • Xingyu Liu, Alex Leonardi, Lu Yu, Chris Gilmer-Hill, Matthew Leavitt, Jonathan Frankle

We find that augmenting future runs with KD from previous runs dramatically reduces the time necessary to train these models, even taking into account the overhead of KD.

Knowledge Distillation

Paper
Add Code

ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases

9 code implementations • 19 Mar 2021 • Stéphane d'Ascoli, Hugo Touvron, Matthew Leavitt, Ari Morcos, Giulio Biroli, Levent Sagun

We initialise the GPSA layers to mimic the locality of convolutional layers, then give each attention head the freedom to escape locality by adjusting a gating parameter regulating the attention paid to position versus content information.

Ranked #482 on Image Classification on ImageNet

Image Classification Inductive Bias

29,758

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.