Image Generation

2036 papers with code • 85 benchmarks • 67 datasets

Image Generation (synthesis) is the task of generating new images from an existing dataset.

Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Benchmarks

Add a Result

These leaderboards are used to track progress in Image Generation

Dataset	Best Model	Compare
CIFAR-10	StyleSAN-XL	See all
ImageNet 64x64	PaGoDA	See all
ImageNet 256x256	PaGoDA	See all
FFHQ 256 x 256	StyleSAN-XL	See all
CelebA 64x64	DDPM-IP	See all
ImageNet 32x32	PaGoDA	See all
ImageNet 512x512	EDM2-XXL w/ guidance interval	See all
LSUN Bedroom 256 x 256	Diffusion ProjectedGAN	See all
STL-10	Diffusion ProjectedGAN	See all
LSUN Churches 256 x 256	Projected GAN	See all
FFHQ 1024 x 1024	StyleSAN-XL	See all
CelebA 256x256	Efficient-VDVAE	See all
ImageNet 128x128	PaGoDA	See all
CelebA-HQ 256x256	RDM	See all
FFHQ-U	Alias-Free-R	See all
MNIST	Locally Masked PixelCNN (8 orders)	See all
CelebA-HQ 1024x1024	RDM	See all
Binarized MNIST	CR-NVAE	See all
LSUN Cat 256 x 256	Vision-aided GAN	See all
CelebA-HQ 128x128	U-Net GAN	See all
CIFAR-100	LeCAM (StyleGAN2 + ADA)	See all
AFHQV2	Polarity-StyleGAN3	See all
AFHQ Cat	Vision-aided GAN	See all
LSUN Horse 256 x 256	Vision-aided GAN	See all
CLEVR	Projected GAN	See all
Cityscapes	Projected GAN	See all
AFHQ Dog	Projected GAN	See all
Fashion-MNIST	PAE	See all
CelebA 128x128	U-Net GAN	See all
AFHQ Wild	Vision-aided GAN	See all
Places50	SinDiffusion	See all
CUB 128 x 128	Projected GAN	See all
Stanford Dogs	Projected GAN	See all
Stanford Cars	Projected GANs	See all
Pokemon 256x256	StyleGAN-XL	See all
VizDoom	GAUDI	See all
Replica	GAUDI	See all
VLN-CE	GAUDI	See all
ARKitScenes	GAUDI	See all
CAT 256x256	StyleGAN2 + DA + RLC (Ours)	See all
ADE-Indoor	Projected GAN	See all
Stacked MNIST	VAEBM	See all
CelebA-HQ 64x64	VAEBM	See all
CIFAR-10 (20% data)	DiffAugment-CR-BigGAN	See all
CIFAR-10 (10% data)	DiffAugment-StyleGAN2	See all
LSUN Bedroom	StyleGAN	See all
FFHQ 512 x 512	StyleSAN-XL	See all
FFHQ 128 x 128	DDPM-IP	See all
ObjectsRoom	GENESIS-V2	See all
ShapeStacks	GENESIS-V2	See all
MetFaces-U	Alias-Free-R	See all
MetFaces	t-Stylegan3-ada (NVIDIA pre-trained)	See all
Pokemon 1024x1024	StyleGAN-XL	See all
Oxford 102 Flowers 256 x 256	Projected GAN	See all
LSUN Car 512 x 384	Polarity-StyleGAN2	See all
LSUN Bedroom 64 x 64	WGAN-GP + TT Update Rule	See all
LSUN Bedroom 128 x 128	LadaGAN	See all
RC-49	cDRE-F-cSP+RS	See all
iNaturalist 2019	StyeGAN2 + NoisyTwins	See all
Cityscapes-5K 256x512	SB-GAN	See all
Cityscapes-25K 256x512	SB-GAN	See all
Indian Celebs 256 x 256	MSG-StyleGAN	See all
LSUN Car 256 x 256	StyleGAN2	See all
Multi-dSprites	GENESIS	See all
GQN	GENESIS	See all
Landscapes 256 x 256	CIPS	See all
Satellite-Buildings 256 x 256	CIPS	See all
Satellite-Landscapes 256 x 256	CIPS	See all
Oxford 102 Flowers 128x128	QSNGAN	See all
25% ImageNet 128x128	LeCAM + DA	See all
LLVIP	pix2pix	See all
SDSS Galaxies	AstroDDPM	See all
NASA Perseverance	Stylegan2-ada	See all
1,078 People 3D Faces Collection Data	Sessiz çığlık	See all
LSUN	BigGAN + gSR	See all
CelebA-HQ 512x512	RDM	See all
CelebA	PR-BigGAN - Recall	See all
LSUN tower 64x64	DDPM-IP	See all
FFHQ 64x64 - 4x upscaling	PFGM++	See all
KMNIST	Spiking-Diffusion	See all
EMNIST-Letters	Spiking-Diffusion	See all
ImageNet 256x256 - 1 labeled data per class	DPT	See all
ImageNet 256x256 - 2 labeled data per class	DPT	See all
ImageNet 256x256 - 5 labeled data per class	DPT	See all
ImageNet 256x256 - 1% labeled data	DPT	See all

Show all 85 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Image Generation models and implementations

open-mmlab/mmgeneration

9 papers

1,833

faceonlive/ai-research

9 papers

259

eriklindernoren/PyTorch-GAN

6 papers

15,865

stability-ai/generative-models

5 papers

22,795

See all 7 libraries.

Datasets

Subtasks

Conditional Image Generation

3D-Aware Image Synthesis

Facial Inpainting

Layout-to-Image Generation

ROI-based image generation

Image Generation from Scene Graphs

Pose-Guided Image Generation

User Constrained Thumbnail Generation

Handwritten Word Generation

Chinese Landscape Painting Generation

person reposing

Infinite Image Generation

Multi class one-shot image synthesis

Single class few-shot image synthesis

Most implemented papers

Most implemented Social Latest No code

Denoising Diffusion Implicit Models

ermongroup/ddim • • ICLR 2021

Denoising diffusion probabilistic models (DDPMs) have achieved high quality image generation without adversarial training, yet they require simulating a Markov chain for many steps to produce a sample.

Paper
Code

Instance Normalization: The Missing Ingredient for Fast Stylization

DmitryUlyanov/texture_nets • • 27 Jul 2016

It this paper we revisit the fast stylization method introduced in Ulyanov et.

Paper
Code

StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks

hanzhanggit/StackGAN • • ICCV 2017

Synthesizing high-quality images from text descriptions is a challenging problem in computer vision and has many practical applications.

Paper
Code

Adversarial Audio Synthesis

chrisdonahue/wavegan • • ICLR 2019

Audio signals are sampled at high temporal resolutions, and learning to synthesize audio requires capturing structure across a range of timescales.

Paper
Code

DRAW: A Recurrent Neural Network For Image Generation

ericjang/draw • • 16 Feb 2015

This paper introduces the Deep Recurrent Attentive Writer (DRAW) neural network architecture for image generation.

Paper
Code

High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs

NVIDIA/pix2pixHD • • CVPR 2018

We present a new method for synthesizing high-resolution photo-realistic images from semantic label maps using conditional generative adversarial networks (conditional GANs).

Paper
Code

NICE: Non-linear Independent Components Estimation

vincentstimper/normalizing-flows • • 30 Oct 2014

It is based on the idea that a good representation is one in which the data has a distribution that is easy to model.

Paper
Code

AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks

taoxugit/AttnGAN • • CVPR 2018

In this paper, we propose an Attentional Generative Adversarial Network (AttnGAN) that allows attention-driven, multi-stage refinement for fine-grained text-to-image generation.

Paper
Code

Pixel Recurrent Neural Networks

EugenHotaj/pytorch-generative • • 25 Jan 2016

Modeling the distribution of natural images is a landmark problem in unsupervised learning.

Paper
Code

BEGAN: Boundary Equilibrium Generative Adversarial Networks

eriklindernoren/PyTorch-GAN • • 31 Mar 2017

We propose a new equilibrium enforcing method paired with a loss derived from the Wasserstein distance for training auto-encoder based Generative Adversarial Networks.

Paper
Code

Image Generation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result