Search Results for author: Depen Morwani

Found 5 papers, 2 papers with code

Feature emergence via margin maximization: case studies in algebraic tasks

no code implementations • 13 Nov 2023 • Depen Morwani, Benjamin L. Edelman, Costin-Andrei Oncescu, Rosie Zhao, Sham Kakade

Understanding the internal representations learned by neural networks is a cornerstone challenge in the science of machine learning.

Paper
Add Code

Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning

no code implementations • 14 Jun 2023 • Nikhil Vyas, Depen Morwani, Rosie Zhao, Gal Kaplun, Sham Kakade, Boaz Barak

The success of SGD in deep learning has been ascribed by prior works to the implicit bias induced by high learning rate or small batch size ("SGD noise").

Paper
Add Code

Feature-Learning Networks Are Consistent Across Widths At Realistic Scales

no code implementations • NeurIPS 2023 • Nikhil Vyas, Alexander Atanasov, Blake Bordelon, Depen Morwani, Sabarish Sainathan, Cengiz Pehlevan

We call this the bias of narrower width.

Language Modelling

Paper
Add Code

Using noise resilience for ranking generalization of deep neural networks

1 code implementation • 16 Dec 2020 • Depen Morwani, Rahul Vashisht, Harish G. Ramaswamy

Recent papers have shown that sufficiently overparameterized neural networks can perfectly fit even random labels.

Position

Paper
Code

Inductive Bias of Gradient Descent for Weight Normalized Smooth Homogeneous Neural Nets

1 code implementation • 24 Oct 2020 • Depen Morwani, Harish G. Ramaswamy

We analyse both standard weight normalization (SWN) and exponential weight normalization (EWN), and show that the gradient flow path with EWN is equivalent to gradient flow on standard networks with an adaptive learning rate.

Inductive Bias

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.