Search Results for author: Jialin Mao

Found 6 papers, 4 papers with code

The Training Process of Many Deep Networks Explores the Same Low-Dimensional Manifold

2 code implementations • 2 May 2023 • Jialin Mao, Itay Griniasty, Han Kheng Teoh, Rahul Ramesh, Rubing Yang, Mark K. Transtrum, James P. Sethna, Pratik Chaudhari

We develop information-geometric techniques to analyze the trajectories of the predictions of deep networks during training.

Data Augmentation

Paper
Code

A picture of the space of typical learnable tasks

2 code implementations • 31 Oct 2022 • Rahul Ramesh, Jialin Mao, Itay Griniasty, Rubing Yang, Han Kheng Teoh, Mark Transtrum, James P. Sethna, Pratik Chaudhari

We develop information geometric techniques to understand the representations learned by deep networks when they are trained on different tasks using supervised, meta-, semi-supervised and contrastive learning.

Contrastive Learning Meta-Learning +1

Paper
Code

Scalable and Efficient Training of Large Convolutional Neural Networks with Differential Privacy

1 code implementation • 21 May 2022 • Zhiqi Bu, Jialin Mao, Shiyun Xu

Large convolutional neural networks (CNN) can be difficult to train in the differentially private (DP) regime, since the optimization algorithms require a computationally expensive operation, known as the per-sample gradient clipping.

Paper
Code

Does the Data Induce Capacity Control in Deep Learning?

1 code implementation • 27 Oct 2021 • Rubing Yang, Jialin Mao, Pratik Chaudhari

This structure is mirrored in a network trained on this data: we show that the Hessian and the Fisher Information Matrix (FIM) have eigenvalues that are spread uniformly over exponentially large ranges.

Generalization Bounds