Contrastive Learning Inverts the Data Generating Process

17 Feb 2021  ·  Roland S. Zimmermann, Yash Sharma, Steffen Schneider, Matthias Bethge, Wieland Brendel ·

Contrastive learning has recently seen tremendous success in self-supervised learning. So far, however, it is largely unclear why the learned representations generalize so effectively to a large variety of downstream tasks. We here prove that feedforward models trained with objectives belonging to the commonly used InfoNCE family learn to implicitly invert the underlying generative model of the observed data. While the proofs make certain statistical assumptions about the generative model, we observe empirically that our findings hold even if these assumptions are severely violated. Our theory highlights a fundamental connection between contrastive learning, generative modeling, and nonlinear independent component analysis, thereby furthering our understanding of the learned representations as well as providing a theoretical foundation to derive more effective contrastive losses.

PDF Abstract

Datasets


Introduced in the Paper:

3DIdent

Used in the Paper:

KITTI-Masks

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Disentanglement 3DIdent InfoNCE (Normal, Box) MCC 98.31 # 1
Disentanglement KITTI-Masks InfoNCE (Laplace, Box) MCC 80.9 # 1

Methods