TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Generation	Binarized MNIST	CR-NVAE	nats	76.93	# 1
Image Generation	CelebA 64x64	CR-NVAE	bits/dimension	1.86	# 2
Image Generation	CIFAR-10	CR-NVAE	bits/dimension	2.51	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/consistency-regularization-for-variational/image-generation-on-binarized-mnist)](https://paperswithcode.com/sota/image-generation-on-binarized-mnist?p=consistency-regularization-for-variational)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/consistency-regularization-for-variational/image-generation-on-celeba-64x64)](https://paperswithcode.com/sota/image-generation-on-celeba-64x64?p=consistency-regularization-for-variational)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/consistency-regularization-for-variational/image-generation-on-cifar-10)](https://paperswithcode.com/sota/image-generation-on-cifar-10?p=consistency-regularization-for-variational)`

Consistency Regularization for Variational Auto-Encoders

NeurIPS 2021 · Samarth Sinha, Adji B. Dieng ·

Variational auto-encoders (VAEs) are a powerful approach to unsupervised learning. They enable scalable approximate posterior inference in latent-variable models using variational inference (VI). A VAE posits a variational family parameterized by a deep neural network called an encoder that takes data as input. This encoder is shared across all the observations, which amortizes the cost of inference. However the encoder of a VAE has the undesirable property that it maps a given observation and a semantics-preserving transformation of it to different latent representations. This "inconsistency" of the encoder lowers the quality of the learned representations, especially for downstream tasks, and also negatively affects generalization. In this paper, we propose a regularization method to enforce consistency in VAEs. The idea is to minimize the Kullback-Leibler (KL) divergence between the variational distribution when conditioning on the observation and the variational distribution when conditioning on a random semantic-preserving transformation of this observation. This regularization is applicable to any VAE. In our experiments we apply it to four different VAE variants on several benchmark datasets and found it always improves the quality of the learned representations but also leads to better generalization. In particular, when applied to the Nouveau Variational Auto-Encoder (NVAE), our regularization method yields state-of-the-art performance on MNIST and CIFAR-10. We also applied our method to 3D data and found it learns representations of superior quality as measured by accuracy on a downstream classification task.

PDF Abstract NeurIPS 2021 PDF NeurIPS 2021 Abstract

Code

Add Remove Mark official

sinhasam/crvae official

Tasks

Add Remove

Image Generation

Variational Inference

Datasets

CIFAR-10

CelebA

ShapeNet

Binarized MNIST

Results from the Paper

Edit

Ranked #1 on Image Generation on Binarized MNIST

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Generation	Binarized MNIST	CR-NVAE	nats	76.93	# 1	Compare
Image Generation	CelebA 64x64	CR-NVAE	bits/dimension	1.86	# 2	Compare
Image Generation	CIFAR-10	CR-NVAE	bits/dimension	2.51	# 4	Compare

Methods

Add Remove

Variational Inference

Edit Social Preview

Consistency Regularization for Variational Auto-Encoders

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove