TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Generation	CelebA 64x64	INDM (VP, FID)	FID	1.75	# 4
Image Generation	CelebA 64x64	INDM (VE, FID)	FID	2.54	# 10
Image Generation	CIFAR-10	INDM (ST)	FID	3.25	# 47
Image Generation	CIFAR-10	INDM (ST)	bits/dimension	3.01	# 33
Image Generation	CIFAR-10	INDM (NLL)	FID	4.79	# 71
Image Generation	CIFAR-10	INDM (NLL)	bits/dimension	2.97	# 29
Image Generation	CIFAR-10	INDM (FID)	FID	2.28	# 27
Image Generation	CIFAR-10	INDM (FID)	bits/dimension	3.09	# 39

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/maximum-likelihood-training-of-implicit/image-generation-on-celeba-64x64)](https://paperswithcode.com/sota/image-generation-on-celeba-64x64?p=maximum-likelihood-training-of-implicit)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/maximum-likelihood-training-of-implicit/image-generation-on-cifar-10)](https://paperswithcode.com/sota/image-generation-on-cifar-10?p=maximum-likelihood-training-of-implicit)`

Maximum Likelihood Training of Implicit Nonlinear Diffusion Models

27 May 2022 · Dongjun Kim, Byeonghu Na, Se Jung Kwon, Dongsoo Lee, Wanmo Kang, Il-Chul Moon ·

Whereas diverse variations of diffusion models exist, extending the linear diffusion into a nonlinear diffusion process is investigated by very few works. The nonlinearity effect has been hardly understood, but intuitively, there would be promising diffusion patterns to efficiently train the generative distribution towards the data distribution. This paper introduces a data-adaptive nonlinear diffusion process for score-based diffusion models. The proposed Implicit Nonlinear Diffusion Model (INDM) learns by combining a normalizing flow and a diffusion process. Specifically, INDM implicitly constructs a nonlinear diffusion on the \textit{data space} by leveraging a linear diffusion on the \textit{latent space} through a flow network. This flow network is key to forming a nonlinear diffusion, as the nonlinearity depends on the flow network. This flexible nonlinearity improves the learning curve of INDM to nearly Maximum Likelihood Estimation (MLE) against the non-MLE curve of DDPM++, which turns out to be an inflexible version of INDM with the flow fixed as an identity mapping. Also, the discretization of INDM shows the sampling robustness. In experiments, INDM achieves the state-of-the-art FID of 1.75 on CelebA. We release our code at \url{https://github.com/byeonghu-na/INDM}.

PDF Abstract

Code

Add Remove Mark official

byeonghu-na/INDM official

Tasks

Add Remove

Image Generation

Datasets

CIFAR-10

CelebA

Results from the Paper

Edit

Ranked #4 on Image Generation on CelebA 64x64

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Generation	CelebA 64x64	INDM (VE, FID)	FID	2.54	# 10	Compare
Image Generation	CIFAR-10	INDM (ST)	FID	3.25	# 47	Compare
Image Generation	CIFAR-10	INDM (ST)	bits/dimension	3.01	# 33	Compare
Image Generation	CIFAR-10	INDM (NLL)	FID	4.79	# 71	Compare
Image Generation	CIFAR-10	INDM (NLL)	bits/dimension	2.97	# 29	Compare
Image Generation	CIFAR-10	INDM (FID)	FID	2.28	# 27	Compare
Image Generation	CIFAR-10	INDM (FID)	bits/dimension	3.09	# 39	Compare

Results from Other Papers

Task	Dataset	Model	Metric Name	Metric Value	Rank	Source Paper	Compare
Image Generation	CelebA 64x64	INDM (VP, FID)	FID	1.75	# 4		See all

Methods

Add Remove

Diffusion

Edit Social Preview

Maximum Likelihood Training of Implicit Nonlinear Diffusion Models

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit