Image Generation

2001 papers with code • 85 benchmarks • 67 datasets

Image Generation (synthesis) is the task of generating new images from an existing dataset.

Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Benchmarks

Add a Result

These leaderboards are used to track progress in Image Generation

Dataset	Best Model	Compare
CIFAR-10	StyleSAN-XL	See all
ImageNet 64x64	RIN	See all
ImageNet 256x256	ViT-XL/2 with limited Interval Guidance	See all
FFHQ 256 x 256	StyleSAN-XL	See all
CelebA 64x64	DDPM-IP	See all
LSUN Bedroom 256 x 256	Diffusion ProjectedGAN	See all
ImageNet 32x32	StyleGAN-XL	See all
STL-10	Diffusion ProjectedGAN	See all
LSUN Churches 256 x 256	Projected GAN	See all
ImageNet 512x512	EDM2-XXL	See all
FFHQ 1024 x 1024	StyleSAN-XL	See all
CelebA 256x256	Efficient-VDVAE	See all
ImageNet 128x128	VDM++	See all
CelebA-HQ 256x256	RDM	See all
FFHQ-U	Alias-Free-R	See all
MNIST	Locally Masked PixelCNN (8 orders)	See all
CelebA-HQ 1024x1024	RDM	See all
Binarized MNIST	CR-NVAE	See all
LSUN Cat 256 x 256	Vision-aided GAN	See all
CelebA-HQ 128x128	U-Net GAN	See all
CIFAR-100	LeCAM (StyleGAN2 + ADA)	See all
AFHQV2	Polarity-StyleGAN3	See all
AFHQ Cat	Vision-aided GAN	See all
LSUN Horse 256 x 256	Vision-aided GAN	See all
CLEVR	Projected GAN	See all
Cityscapes	Projected GAN	See all
AFHQ Dog	Projected GAN	See all
Fashion-MNIST	PAE	See all
CelebA 128x128	U-Net GAN	See all
AFHQ Wild	Vision-aided GAN	See all
Places50	SinDiffusion	See all
CUB 128 x 128	Projected GAN	See all
Stanford Dogs	Projected GAN	See all
Stanford Cars	Projected GANs	See all
Pokemon 256x256	StyleGAN-XL	See all
VizDoom	GAUDI	See all
Replica	GAUDI	See all
VLN-CE	GAUDI	See all
ARKitScenes	GAUDI	See all
CAT 256x256	StyleGAN2 + DA + RLC (Ours)	See all
ADE-Indoor	Projected GAN	See all
Stacked MNIST	VAEBM	See all
CelebA-HQ 64x64	VAEBM	See all
CIFAR-10 (20% data)	DiffAugment-CR-BigGAN	See all
CIFAR-10 (10% data)	DiffAugment-StyleGAN2	See all
LSUN Bedroom	StyleGAN	See all
FFHQ 512 x 512	StyleSAN-XL	See all
FFHQ 128 x 128	DDPM-IP	See all
ObjectsRoom	GENESIS-V2	See all
ShapeStacks	GENESIS-V2	See all
MetFaces-U	Alias-Free-R	See all
MetFaces	t-Stylegan3-ada (NVIDIA pre-trained)	See all
Pokemon 1024x1024	StyleGAN-XL	See all
Oxford 102 Flowers 256 x 256	Projected GAN	See all
LSUN Car 512 x 384	Polarity-StyleGAN2	See all
LSUN Bedroom 64 x 64	WGAN-GP + TT Update Rule	See all
LSUN Bedroom 128 x 128	LadaGAN	See all
RC-49	cDRE-F-cSP+RS	See all
iNaturalist 2019	StyeGAN2 + NoisyTwins	See all
Cityscapes-5K 256x512	SB-GAN	See all
Cityscapes-25K 256x512	SB-GAN	See all
Indian Celebs 256 x 256	MSG-StyleGAN	See all
LSUN Car 256 x 256	StyleGAN2	See all
Multi-dSprites	GENESIS	See all
GQN	GENESIS	See all
Landscapes 256 x 256	CIPS	See all
Satellite-Buildings 256 x 256	CIPS	See all
Satellite-Landscapes 256 x 256	CIPS	See all
Oxford 102 Flowers 128x128	QSNGAN	See all
25% ImageNet 128x128	LeCAM + DA	See all
LLVIP	pix2pix	See all
SDSS Galaxies	AstroDDPM	See all
NASA Perseverance	Stylegan2-ada	See all
1,078 People 3D Faces Collection Data	Sessiz çığlık	See all
LSUN	BigGAN + gSR	See all
CelebA-HQ 512x512	RDM	See all
CelebA	PR-BigGAN - Recall	See all
LSUN tower 64x64	DDPM-IP	See all
FFHQ 64x64 - 4x upscaling	PFGM++	See all
KMNIST	Spiking-Diffusion	See all
EMNIST-Letters	Spiking-Diffusion	See all
ImageNet 256x256 - 1 labeled data per class	DPT	See all
ImageNet 256x256 - 2 labeled data per class	DPT	See all
ImageNet 256x256 - 5 labeled data per class	DPT	See all
ImageNet 256x256 - 1% labeled data	DPT	See all

Show all 85 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Image Generation models and implementations

open-mmlab/mmgeneration

9 papers

1,809

faceonlive/ai-research

9 papers

195

eriklindernoren/PyTorch-GAN

6 papers

15,742

stability-ai/generative-models

5 papers

22,394

See all 8 libraries.

Datasets

Subtasks

Conditional Image Generation

3D-Aware Image Synthesis

Facial Inpainting

Layout-to-Image Generation

ROI-based image generation

Image Generation from Scene Graphs

Pose-Guided Image Generation

User Constrained Thumbnail Generation

Handwritten Word Generation

Chinese Landscape Painting Generation

person reposing

Infinite Image Generation

Multi class one-shot image synthesis

Single class few-shot image synthesis

Latest papers with no code

Most implemented Social Latest No code

InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation

no code yet • 30 Apr 2024

Additionally, our masked cross-attention mechanism enables the precise control of multi-ID and composition in the generated images.

Paper
Add Code

Hide and Seek: How Does Watermarking Impact Face Recognition?

no code yet • 29 Apr 2024

The recent progress in generative models has revolutionized the synthesis of highly realistic images, including face images.

Paper
Add Code

Anywhere: A Multi-Agent Framework for Reliable and Diverse Foreground-Conditioned Image Inpainting

no code yet • 29 Apr 2024

In the image generation module, we employ a text-guided canny-to-image generation model to create a template image based on the edge map of the foreground image and language prompts, and an image refiner to produce the outcome by blending the input foreground and the template image.

Paper
Add Code

Autonomous Quality and Hallucination Assessment for Virtual Tissue Staining and Digital Pathology

no code yet • 29 Apr 2024

Here, we present an autonomous quality and hallucination assessment method (termed AQuA), mainly designed for virtual tissue staining, while also being applicable to histochemical staining.

Paper
Add Code

PKU-AIGIQA-4K: A Perceptual Quality Assessment Database for Both Text-to-Image and Image-to-Image AI-Generated Images

no code yet • 29 Apr 2024

This oversight highlights a critical gap in the current research landscape, underscoring the need for dedicated databases catering to image-to-image scenarios, as well as more comprehensive databases that encompass a broader range of AI-generated image scenarios.

Paper
Add Code

Learning Mixtures of Gaussians Using Diffusion Models

no code yet • 29 Apr 2024

We give a new algorithm for learning mixtures of $k$ Gaussians (with identity covariance in $\mathbb{R}^n$) to TV error $\varepsilon$, with quasi-polynomial ($O(n^{\text{poly log}\left(\frac{n+k}{\varepsilon}\right)})$) time and sample complexity, under a minimum weight assumption.

Paper
Add Code

Grounded Compositional and Diverse Text-to-3D with Pretrained Multi-View Diffusion Model

no code yet • 28 Apr 2024

In this paper, we propose an effective two-stage approach named Grounded-Dreamer to generate 3D assets that can accurately follow complex, compositional text prompts while achieving high fidelity by using a pre-trained multi-view diffusion model.

Paper
Add Code

Fisher Information Improved Training-Free Conditional Diffusion Model

no code yet • 28 Apr 2024

Recently, the diffusion model with the training-free methods has succeeded in conditional image generation tasks.

Paper
Add Code

Causal Diffusion Autoencoders: Toward Counterfactual Generation via Diffusion Probabilistic Models

no code yet • 27 Apr 2024

We empirically show that CausalDiffAE learns a disentangled latent space and is capable of generating high-quality counterfactual images.

Paper
Add Code

Trinity Detector:text-assisted and attention mechanisms based spectral fusion for diffusion generation image detection

no code yet • 26 Apr 2024

Artificial Intelligence Generated Content (AIGC) techniques, represented by text-to-image generation, have led to a malicious use of deep forgeries, raising concerns about the trustworthiness of multimedia content.

Paper
Add Code

Image Generation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result