Search Results for author: Adityanarayanan Radhakrishnan

Found 21 papers, 6 papers with code

Linear Recursive Feature Machines provably recover low-rank matrices

1 code implementation • 9 Jan 2024 • Adityanarayanan Radhakrishnan, Mikhail Belkin, Dmitriy Drusvyatskiy

A possible explanation is that common training algorithms for neural networks implicitly perform dimensionality reduction - a process called feature learning.

Dimensionality Reduction Low-Rank Matrix Completion +1

Paper
Code

Mechanism of feature learning in convolutional neural networks

1 code implementation • 1 Sep 2023 • Daniel Beaglehole, Adityanarayanan Radhakrishnan, Parthe Pandit, Mikhail Belkin

We then demonstrate the generality of our result by using the patch-based AGOP to enable deep feature learning in convolutional kernel machines.

Paper
Code

Catapults in SGD: spikes in the training loss and their impact on generalization through feature learning

no code implementations • 7 Jun 2023 • Libin Zhu, Chaoyue Liu, Adityanarayanan Radhakrishnan, Mikhail Belkin

In this paper, we first present an explanation regarding the common occurrence of spikes in the training loss when neural networks are trained with stochastic gradient descent (SGD).

Paper
Add Code

Mechanism of feature learning in deep fully connected networks and kernel machines that recursively learn features

3 code implementations • 28 Dec 2022 • Adityanarayanan Radhakrishnan, Daniel Beaglehole, Parthe Pandit, Mikhail Belkin

In recent years neural networks have achieved impressive results on many technological and scientific tasks.

Paper
Code

Transfer Learning with Kernel Methods

no code implementations • 1 Nov 2022 • Adityanarayanan Radhakrishnan, Max Ruiz Luyten, Neha Prasad, Caroline Uhler

In this work, we propose a transfer learning framework for kernel methods by projecting and translating the source model to the target task.

Image Classification Transfer Learning

Paper
Add Code

Quadratic models for understanding catapult dynamics of neural networks

1 code implementation • 24 May 2022 • Libin Zhu, Chaoyue Liu, Adityanarayanan Radhakrishnan, Mikhail Belkin

While neural networks can be approximated by linear models as their width increases, certain properties of wide neural networks cannot be captured by linear models.

Paper
Code

Wide and Deep Neural Networks Achieve Optimality for Classification

no code implementations • 29 Apr 2022 • Adityanarayanan Radhakrishnan, Mikhail Belkin, Caroline Uhler

In this work, we identify and construct an explicit set of neural network classifiers that achieve optimality.

Classification

Paper
Add Code

Local Quadratic Convergence of Stochastic Gradient Descent with Adaptive Step Size

no code implementations • 30 Dec 2021 • Adityanarayanan Radhakrishnan, Mikhail Belkin, Caroline Uhler

Establishing a fast rate of convergence for optimization methods is crucial to their applicability in practice.

Paper
Add Code

Simple, Fast, and Flexible Framework for Matrix Completion with Infinite Width Neural Networks

1 code implementation • 31 Jul 2021 • Adityanarayanan Radhakrishnan, George Stefanakis, Mikhail Belkin, Caroline Uhler

Remarkably, taking the width of a neural network to infinity allows for improved computational performance.

Image Inpainting Matrix Completion +1

Paper
Code

A Mechanism for Producing Aligned Latent Spaces with Autoencoders

no code implementations • 29 Jun 2021 • Saachi Jain, Adityanarayanan Radhakrishnan, Caroline Uhler

Aligned latent spaces, where meaningful semantic shifts in the input space correspond to a translation in the embedding space, play an important role in the success of downstream tasks such as unsupervised clustering and data imputation.

Clustering Imputation +1

Paper
Add Code

LLBoost: Last Layer Perturbation to Boost Pre-trained Neural Networks

no code implementations • 1 Jan 2021 • Adityanarayanan Radhakrishnan, Neha Prasad, Caroline Uhler

While deep networks have produced state-of-the-art results in several domains from image classification to machine translation, hyper-parameter selection remains a significant computational bottleneck.

Image Classification Machine Translation

Paper
Add Code

Increasing Depth Leads to U-Shaped Test Risk in Over-parameterized Convolutional Networks

no code implementations • 19 Oct 2020 • Eshaan Nichani, Adityanarayanan Radhakrishnan, Caroline Uhler

We then present a novel linear regression framework for characterizing the impact of depth on test risk, and show that increasing depth leads to a U-shaped test risk for the linear CNTK.

Image Classification Open-Ended Question Answering +1

Paper
Add Code

Do Deeper Convolutional Networks Perform Better?

no code implementations • 28 Sep 2020 • Eshaan Nichani, Adityanarayanan Radhakrishnan, Caroline Uhler

Recent work provided an explanation for this phenomenon by introducing the double descent curve, showing that increasing model capacity past the interpolation threshold leads to a decrease in test error.

Learning Theory

Paper
Add Code

Linear Convergence and Implicit Regularization of Generalized Mirror Descent with Time-Dependent Mirrors

no code implementations • 28 Sep 2020 • Adityanarayanan Radhakrishnan, Mikhail Belkin, Caroline Uhler

The following questions are fundamental to understanding the properties of over-parameterization in modern machine learning: (1) Under what conditions and at what rate does training converge to a global minimum?

Paper
Add Code

Linear Convergence of Generalized Mirror Descent with Time-Dependent Mirrors

no code implementations • 18 Sep 2020 • Adityanarayanan Radhakrishnan, Mikhail Belkin, Caroline Uhler

GMD subsumes popular first order optimization methods including gradient descent, mirror descent, and preconditioned gradient descent methods such as Adagrad.

Paper
Add Code

On Alignment in Deep Linear Neural Networks

no code implementations • 13 Mar 2020 • Adityanarayanan Radhakrishnan, Eshaan Nichani, Daniel Bernstein, Caroline Uhler

We define alignment for fully connected networks with multidimensional outputs and show that it is a natural extension of alignment in networks with 1-dimensional outputs as defined by Ji and Telgarsky, 2018.

Paper
Add Code

Overparameterized Neural Networks Implement Associative Memory

1 code implementation • 26 Sep 2019 • Adityanarayanan Radhakrishnan, Mikhail Belkin, Caroline Uhler

Identifying computational mechanisms for memorization and retrieval of data is a long-standing problem at the intersection of machine learning and neuroscience.

Memorization Retrieval

Paper
Code

Overparameterized Neural Networks Can Implement Associative Memory

no code implementations • 25 Sep 2019 • Adityanarayanan Radhakrishnan, Mikhail Belkin, Caroline Uhler

Identifying computational mechanisms for memorization and retrieval is a long-standing problem at the intersection of machine learning and neuroscience.

Memorization Retrieval

Paper
Add Code

Downsampling leads to Image Memorization in Convolutional Autoencoders

no code implementations • ICLR 2019 • Adityanarayanan Radhakrishnan, Caroline Uhler, Mikhail Belkin

In this paper, we link memorization of images in deep convolutional autoencoders to downsampling through strided convolution.

Memorization

Paper
Add Code

Memorization in Overparameterized Autoencoders

no code implementations • ICML Workshop Deep_Phenomen 2019 • Adityanarayanan Radhakrishnan, Karren Yang, Mikhail Belkin, Caroline Uhler

The ability of deep neural networks to generalize well in the overparameterized regime has become a subject of significant research interest.

Inductive Bias Memorization

Paper
Add Code

Patchnet: Interpretable Neural Networks for Image Classification

no code implementations • 23 May 2017 • Adityanarayanan Radhakrishnan, Charles Durham, Ali Soylemezoglu, Caroline Uhler

Understanding how a complex machine learning model makes a classification decision is essential for its acceptance in sensitive areas such as health care.

BIG-bench Machine Learning Classification +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.