TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Generation	CIFAR-10	ViTGAN with StyleGAN2	Inception score	9.89	# 7
Image Generation	CIFAR-10	ViTGAN with StyleGAN2	FID	4.57	# 68
Image Generation	CIFAR-10	ViTGAN	Inception score	9.3	# 20
Image Generation	CIFAR-10	ViTGAN	FID	6.66	# 76

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/vitgan-training-gans-with-vision-transformers/image-generation-on-cifar-10)](https://paperswithcode.com/sota/image-generation-on-cifar-10?p=vitgan-training-gans-with-vision-transformers)`

ViTGAN: Training GANs with Vision Transformers

ICLR 2022 · Kwonjoon Lee, Huiwen Chang, Lu Jiang, Han Zhang, Zhuowen Tu, Ce Liu ·

Recently, Vision Transformers (ViTs) have shown competitive performance on image recognition while requiring less vision-specific inductive biases. In this paper, we investigate if such observation can be extended to image generation. To this end, we integrate the ViT architecture into generative adversarial networks (GANs). We observe that existing regularization methods for GANs interact poorly with self-attention, causing serious instability during training. To resolve this issue, we introduce novel regularization techniques for training GANs with ViTs. Empirically, our approach, named ViTGAN, achieves comparable performance to state-of-the-art CNN-based StyleGAN2 on CIFAR-10, CelebA, and LSUN bedroom datasets.

PDF Abstract ICLR 2022 PDF ICLR 2022 Abstract

Code

Add Remove Mark official

mlpc-ucsd/ViTGAN official

lucidrains/parti-pytorch

506

wilile26811249/ViTGAN

159

Tasks

Add Remove

Image Generation

Datasets

CIFAR-10

CelebA

LSUN

Results from the Paper

Edit

Ranked #68 on Image Generation on CIFAR-10

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Generation	CIFAR-10	ViTGAN with StyleGAN2	Inception score	9.89	# 7	Compare
Image Generation	CIFAR-10	ViTGAN with StyleGAN2	FID	4.57	# 68	Compare
Image Generation	CIFAR-10	ViTGAN	Inception score	9.3	# 20	Compare
Image Generation	CIFAR-10	ViTGAN	FID	6.66	# 76	Compare

Methods

Add Remove

Convolution • Leaky ReLU • Path Length Regularization • R1 Regularization • StyleGAN2 • Weight Demodulation

Edit Social Preview

ViTGAN: Training GANs with Vision Transformers

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove