Denoising Diffusion Probabilistic Models

NeurIPS 2020  ·  Jonathan Ho, Ajay Jain, Pieter Abbeel ·

We present high quality image synthesis results using diffusion probabilistic models, a class of latent variable models inspired by considerations from nonequilibrium thermodynamics. Our best results are obtained by training on a weighted variational bound designed according to a novel connection between diffusion probabilistic models and denoising score matching with Langevin dynamics, and our models naturally admit a progressive lossy decompression scheme that can be interpreted as a generalization of autoregressive decoding. On the unconditional CIFAR10 dataset, we obtain an Inception score of 9.46 and a state-of-the-art FID score of 3.17. On 256x256 LSUN, we obtain sample quality similar to ProgressiveGAN. Our implementation is available at https://github.com/hojonathanho/diffusion

PDF Abstract NeurIPS 2020 PDF NeurIPS 2020 Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Density Estimation CIFAR-10 DDPM NLL (bits/dim) 3.69 # 11
Image Generation CIFAR-10 Denoising Diffusion Inception score 9.46 # 17
FID 3.17 # 45
bits/dimension 3.75 # 68
Image Generation CIFAR-10 DDPM (L) FID 13.51 # 106
bits/dimension 3.7 # 67
Image Generation LSUN Bedroom Denoising Diffusion Probabilistic Model FID-50k 4.9 # 2
Image Generation LSUN Bedroom 256 x 256 Denoising Diffusion Probabilistic Model FID 6.36 # 14
Image Generation LSUN Bedroom 256 x 256 Denoising Diffusion Probabilistic Model (large) FID 4.9 # 11
Image Generation LSUN Bedroom 256 x 256 Denoising Diffusion Probabilistic Model (large, DINOv2) FD 229.76 # 3
Precision 0.79 # 6
Recall 0.61 # 3
Image Generation LSUN Cat 256 x 256 Denoising Diffusion Probabilistic Model FID 19.75 # 6
Image Generation LSUN Churches 256 x 256 Denoising Diffusion Probabilistic Model FID 7.89 # 21

Methods