Image-to-Image Translation

491 papers with code • 37 benchmarks • 29 datasets

Image-to-Image Translation is a task in computer vision and machine learning where the goal is to learn a mapping between an input image and an output image, such that the output image can be used to perform a specific task, such as style transfer, data augmentation, or image restoration.

( Image credit: Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks )

Benchmarks

Add a Result

These leaderboards are used to track progress in Image-to-Image Translation

Dataset	Best Model	Compare
SYNTHIA-to-Cityscapes	HRDA + PiPa	See all
GTAV-to-Cityscapes Labels	MIC	See all
Cityscapes Labels-to-Photo	DP-SIMS (ConvNext-L)	See all
ADE20K Labels-to-Photos	DP-SIMS (ConvNext-L)	See all
COCO-Stuff Labels-to-Photos	DP-SIMS (ConvNext-XL)	See all
ADE20K-Outdoor Labels-to-Photos	DP-GAN	See all
IXI	ResViT	See all
CelebA-HQ	StarGAN v2	See all
Cityscapes-to-Foggy Cityscapes	MIC	See all
Cityscapes Photo-to-Labels	pix2pix	See all
cat2dog	GNR	See all
RaFD	StarGAN	See all
selfie2anime	GNR	See all
LLVIP	pyramidpix2pix	See all
BCI	pyramidpix2pix	See all
horse2zebra	U-GAT-IT	See all
photo2vangogh	U-GAT-IT	See all
zebra2horse	U-GAT-IT	See all
vangogh2photo	U-GAT-IT	See all
SYNTHIA Fall-to-Winter	CyCADA	See all
Aerial-to-Map	cGAN	See all
selfie-to-anime	FQ-GAN	See all
anime-to-selfie	FQ-GAN	See all
AFHQ	StarGAN v2	See all
Deep-Fashion	INADE	See all
Object Transfiguration (sheep-to-giraffe)	InstaGAN	See all
ADE-Indoor Labels-to-Photo	SB-GAN	See all
photo2portrait	U-GAT-IT	See all
dog2cat	U-GAT-IT	See all
portrait2photo	U-GAT-IT	See all
KITTI Object Tracking Evaluation 2012	SRNet	See all
Zebra and Horses	Shared discriminator GAN	See all
Apples and Oranges	Shared discriminator GAN	See all
2017_test set	hi	See all
BRATS	ResViT	See all
AFHQ (Cat to Dog)	EGSDE	See all
AFHQ (Wild to Dog)	EGSDE	See all

Show all 37 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Image-to-Image Translation models and implementations

eriklindernoren/PyTorch-GAN

8 papers

15,711

Wenchao-Du/LIR-for-Unsupervised-IR

3 papers

yaxingwang/SEMIT

3 papers

ganslate-team/ganslate

3 papers

See all 16 libraries.

Datasets

Subtasks

Cross-View Image-to-Image Translation

Fundus to Angiography Generation

Facial Makeup Transfer

Real-to-Cartoon translation

Photo-To-Caricature Translation

Bird View Synthesis

Most implemented papers

Most implemented Social Latest No code

Deep Residual Learning for Image Recognition

tensorflow/models • • CVPR 2016

Deep residual nets are foundations of our submissions to ILSVRC & COCO 2015 competitions, where we also won the 1st places on the tasks of ImageNet detection, ImageNet localization, COCO detection, and COCO segmentation.

469

Paper
Code

Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks

junyanz/pytorch-CycleGAN-and-pix2pix • • ICCV 2017

Image-to-image translation is a class of vision and graphics problems where the goal is to learn the mapping between an input image and an output image using a training set of aligned image pairs.

187

Paper
Code

Image-to-Image Translation with Conditional Adversarial Networks

phillipi/pix2pix • • CVPR 2017

We investigate conditional adversarial networks as a general-purpose solution to image-to-image translation problems.

176

Paper
Code

StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation

eriklindernoren/PyTorch-GAN • • CVPR 2018

To address this limitation, we propose StarGAN, a novel and scalable approach that can perform image-to-image translations for multiple domains using only a single model.

Paper
Code

U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation

taki0112/UGATIT • • ICLR 2020

We propose a novel method for unsupervised image-to-image translation, which incorporates a new attention module and a new learnable normalization function in an end-to-end manner.

Paper
Code

Semantic Image Synthesis with Spatially-Adaptive Normalization

NVlabs/SPADE • • CVPR 2019

Previous methods directly feed the semantic layout as input to the deep network, which is then processed through stacks of convolution, normalization, and nonlinearity layers.

Paper
Code

High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs

NVIDIA/pix2pixHD • • CVPR 2018

We present a new method for synthesizing high-resolution photo-realistic images from semantic label maps using conditional generative adversarial networks (conditional GANs).

Paper
Code

Multimodal Unsupervised Image-to-Image Translation

nvlabs/MUNIT • • ECCV 2018

To translate an image to another domain, we recombine its content code with a random style code sampled from the style space of the target domain.

Paper
Code

StarGAN v2: Diverse Image Synthesis for Multiple Domains

clovaai/stargan-v2 • • CVPR 2020

A good image-to-image translation model should learn a mapping between different visual domains while satisfying the following properties: 1) diversity of generated images and 2) scalability over multiple domains.

Paper
Code

Everybody Dance Now

carolineec/EverybodyDanceNow • • ICCV 2019

This paper presents a simple method for "do as I do" motion transfer: given a source video of a person dancing, we can transfer that performance to a novel (amateur) target after only a few minutes of the target subject performing standard moves.

Paper
Code

Image-to-Image Translation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result