Search Results for author: Ravid Shwartz-Ziv

Found 17 papers, 8 papers with code

Simplifying Neural Network Training Under Class Imbalance

1 code implementation • NeurIPS 2023 • Ravid Shwartz-Ziv, Micah Goldblum, Yucen Lily Li, C. Bayan Bruss, Andrew Gordon Wilson

Real-world datasets are often highly class-imbalanced, which can adversely impact the performance of deep learning models.

Data Augmentation

Paper
Code

Sudden Drops in the Loss: Syntax Acquisition, Phase Transitions, and Simplicity Bias in MLMs

no code implementations • 13 Sep 2023 • Angelica Chen, Ravid Shwartz-Ziv, Kyunghyun Cho, Matthew L. Leavitt, Naomi Saphra

Most interpretability research in NLP focuses on understanding the behavior and features of a fully trained model.

Paper
Add Code

Variance-Covariance Regularization Improves Representation Learning

no code implementations • 23 Jun 2023 • Jiachen Zhu, Katrina Evtimova, Yubei Chen, Ravid Shwartz-Ziv, Yann Lecun

In summary, VCReg offers a universally applicable regularization framework that significantly advances transfer learning and highlights the connection between gradient starvation, neural collapse, and feature transferability.

Long-tail Learning Representation Learning +2

Paper
Add Code

Reverse Engineering Self-Supervised Learning

1 code implementation • NeurIPS 2023 • Ido Ben-Shaul, Ravid Shwartz-Ziv, Tomer Galanti, Shai Dekel, Yann Lecun

Self-supervised learning (SSL) is a powerful tool in machine learning, but understanding the learned representations and their underlying mechanisms remains a challenge.

Clustering Representation Learning +1

2,745

Paper
Code

To Compress or Not to Compress- Self-Supervised Learning and Information Theory: A Review

no code implementations • 19 Apr 2023 • Ravid Shwartz-Ziv, Yann Lecun

Information theory, and notably the information bottleneck principle, has been pivotal in shaping deep neural networks.

Self-Supervised Learning

Paper
Add Code

An Information-Theoretic Perspective on Variance-Invariance-Covariance Regularization

no code implementations • 1 Mar 2023 • Ravid Shwartz-Ziv, Randall Balestriero, Kenji Kawaguchi, Tim G. J. Rudner, Yann Lecun

In this paper, we provide an information-theoretic perspective on Variance-Invariance-Covariance Regularization (VICReg) for self-supervised learning.

Self-Supervised Learning Transfer Learning

Paper
Add Code

How Much Data Are Augmentations Worth? An Investigation into Scaling Laws, Invariance, and Implicit Regularization

1 code implementation • 12 Oct 2022 • Jonas Geiping, Micah Goldblum, Gowthami Somepalli, Ravid Shwartz-Ziv, Tom Goldstein, Andrew Gordon Wilson

Despite the clear performance benefits of data augmentations, little is known about why they are so effective.

Paper
Code

What Do We Maximize in Self-Supervised Learning?

no code implementations • 20 Jul 2022 • Ravid Shwartz-Ziv, Randall Balestriero, Yann Lecun

In this paper, we examine self-supervised learning methods, particularly VICReg, to provide an information-theoretical understanding of their construction.

Self-Supervised Learning Transfer Learning

Paper
Add Code

Pre-Train Your Loss: Easy Bayesian Transfer Learning with Informative Priors

1 code implementation • 20 May 2022 • Ravid Shwartz-Ziv, Micah Goldblum, Hossein Souri, Sanyam Kapoor, Chen Zhu, Yann Lecun, Andrew Gordon Wilson

Deep learning is increasingly moving towards a transfer learning paradigm whereby large foundation models are fine-tuned on downstream tasks, starting from an initialization learned on the source task.

Transfer Learning

107

Paper
Code

Information Flow in Deep Neural Networks

no code implementations • 10 Feb 2022 • Ravid Shwartz-Ziv

Then, we propose using the Information Bottleneck (IB) theory to explain deep learning systems.

Paper
Add Code

Tabular Data: Deep Learning is Not All You Need

1 code implementation • ICML Workshop AutoML 2021 • Ravid Shwartz-Ziv, Amitai Armon

A key element in solving real-life data science problems is selecting the types of models to use.

AutoML General Classification

Paper
Code

Spatial-Temporal Convolutional Network for Spread Prediction of COVID-19

no code implementations • 27 Dec 2020 • Ravid Shwartz-Ziv, Itamar Ben Ari, Amitai Armon

In this work we present a spatial-temporal convolutional neural network for predicting future COVID-19 related symptoms severity among a population, per region, given its past reported symptoms.

Paper
Add Code

The Dual Information Bottleneck

1 code implementation • 8 Jun 2020 • Zoe Piran, Ravid Shwartz-Ziv, Naftali Tishby

The Information Bottleneck (IB) framework is a general characterization of optimal representations obtained using a principled approach for balancing accuracy and complexity.

Information Plane

Paper
Code

Information in Infinite Ensembles of Infinitely-Wide Neural Networks

1 code implementation • pproximateinference AABI Symposium 2019 • Ravid Shwartz-Ziv, Alexander A. Alemi

In this preliminary work, we study the generalization properties of infinite ensembles of infinitely-wide neural networks.

Paper
Code

REPRESENTATION COMPRESSION AND GENERALIZATION IN DEEP NEURAL NETWORKS

no code implementations • ICLR 2019 • Ravid Shwartz-Ziv, Amichai Painsky, Naftali Tishby

Specifically, we show that the training of the network is characterized by a rapid increase in the mutual information (MI) between the layers and the target label, followed by a longer decrease in the MI between the layers and the input variable.

Information Plane

Paper
Add Code

Attentioned Convolutional LSTM InpaintingNetwork for Anomaly Detection in Videos

no code implementations • 26 Nov 2018 • Itamar Ben-Ari, Ravid Shwartz-Ziv

Our model is shown to be effective in detecting anomalies in videos.

Anomaly Detection Common Sense Reasoning

Paper
Add Code

Opening the Black Box of Deep Neural Networks via Information

13 code implementations • 2 Mar 2017 • Ravid Shwartz-Ziv, Naftali Tishby

Previous work proposed to analyze DNNs in the \textit{Information Plane}; i. e., the plane of the Mutual Information values that each layer preserves on the input and output variables.

Information Plane

258

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.