TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Generation	CelebA-HQ 1024x1024	WaveDiff	FID	5.98	# 4
Image Generation	CelebA-HQ 1024x1024	WaveDiff	NFE	2	# 1
Image Generation	CelebA-HQ 256x256	WaveDiff	FID	5.94	# 6
Image Generation	CelebA-HQ 256x256	WaveDiff	Recall	0.37	# 3
Image Generation	CelebA-HQ 256x256	WaveDiff	NFE	2	# 1
Image Generation	CelebA-HQ 512x512	WaveDiff	FID	6.40	# 1
Image Generation	CelebA-HQ 512x512	WaveDiff	Recall	0.35	# 1
Image Generation	CelebA-HQ 512x512	WaveDiff	NFE	2	# 1
Image Generation	CIFAR-10	WaveDiff	FID	4.01	# 61
Image Generation	CIFAR-10	WaveDiff	Recall	0.55	# 5
Image Generation	CIFAR-10	WaveDiff	NFE	4	# 14
Image Generation	LSUN Churches 256 x 256	WaveDiff	FID	5.06	# 16
Image Generation	LSUN Churches 256 x 256	WaveDiff	Recall	0.40	# 1
Image Generation	LSUN Churches 256 x 256	WaveDiff	NFE	4	# 1
Image Generation	STL-10	WaveDiff	FID	12.93	# 4
Image Generation	STL-10	WaveDiff	Recall	0.41	# 2
Image Generation	STL-10	WaveDiff	NFE	4	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavelet-diffusion-models-are-fast-and/image-generation-on-celeba-hq-512x512)](https://paperswithcode.com/sota/image-generation-on-celeba-hq-512x512?p=wavelet-diffusion-models-are-fast-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavelet-diffusion-models-are-fast-and/image-generation-on-celeba-hq-1024x1024)](https://paperswithcode.com/sota/image-generation-on-celeba-hq-1024x1024?p=wavelet-diffusion-models-are-fast-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavelet-diffusion-models-are-fast-and/image-generation-on-stl-10)](https://paperswithcode.com/sota/image-generation-on-stl-10?p=wavelet-diffusion-models-are-fast-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavelet-diffusion-models-are-fast-and/image-generation-on-celeba-hq-256x256)](https://paperswithcode.com/sota/image-generation-on-celeba-hq-256x256?p=wavelet-diffusion-models-are-fast-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavelet-diffusion-models-are-fast-and/image-generation-on-lsun-churches-256-x-256)](https://paperswithcode.com/sota/image-generation-on-lsun-churches-256-x-256?p=wavelet-diffusion-models-are-fast-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavelet-diffusion-models-are-fast-and/image-generation-on-cifar-10)](https://paperswithcode.com/sota/image-generation-on-cifar-10?p=wavelet-diffusion-models-are-fast-and)`

Wavelet Diffusion Models are fast and scalable Image Generators

CVPR 2023 · Hao Phung, Quan Dao, Anh Tran ·

Diffusion models are rising as a powerful solution for high-fidelity image generation, which exceeds GANs in quality in many circumstances. However, their slow training and inference speed is a huge bottleneck, blocking them from being used in real-time applications. A recent DiffusionGAN method significantly decreases the models' running time by reducing the number of sampling steps from thousands to several, but their speeds still largely lag behind the GAN counterparts. This paper aims to reduce the speed gap by proposing a novel wavelet-based diffusion scheme. We extract low-and-high frequency components from both image and feature levels via wavelet decomposition and adaptively handle these components for faster processing while maintaining good generation quality. Furthermore, we propose to use a reconstruction term, which effectively boosts the model training convergence. Experimental results on CelebA-HQ, CIFAR-10, LSUN-Church, and STL-10 datasets prove our solution is a stepping-stone to offering real-time and high-fidelity diffusion models. Our code and pre-trained checkpoints are available at \url{https://github.com/VinAIResearch/WaveDiff.git}.

PDF Abstract CVPR 2023 PDF CVPR 2023 Abstract

Code

Add Remove Mark official

vinairesearch/wavediff official

321

Tasks

Add Remove

Blocking

Image Generation

Datasets

CIFAR-10

STL-10

CelebA-HQ

LSUN

Results from the Paper

Edit

Ranked #1 on Image Generation on CelebA-HQ 512x512

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Generation	CelebA-HQ 1024x1024	WaveDiff	FID	5.98	# 4	Compare
Image Generation	CelebA-HQ 1024x1024	WaveDiff	NFE	2	# 1	Compare
Image Generation	CelebA-HQ 256x256	WaveDiff	FID	5.94	# 6	Compare
			Recall	0.37	# 3	Compare
			NFE	2	# 1	Compare
Image Generation	CelebA-HQ 512x512	WaveDiff	FID	6.40	# 1	Compare
			Recall	0.35	# 1	Compare
			NFE	2	# 1	Compare
Image Generation	CIFAR-10	WaveDiff	FID	4.01	# 61	Compare
			Recall	0.55	# 5	Compare
			NFE	4	# 14	Compare
Image Generation	LSUN Churches 256 x 256	WaveDiff	FID	5.06	# 16	Compare
			Recall	0.40	# 1	Compare
			NFE	4	# 1	Compare
Image Generation	STL-10	WaveDiff	FID	12.93	# 4	Compare
			Recall	0.41	# 2	Compare
			NFE	4	# 1	Compare

Methods

Add Remove

Diffusion • SPEED

Edit Social Preview

Wavelet Diffusion Models are fast and scalable Image Generators

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove