Image-to-Image Translation

492 papers with code • 37 benchmarks • 29 datasets

Image-to-Image Translation is a task in computer vision and machine learning where the goal is to learn a mapping between an input image and an output image, such that the output image can be used to perform a specific task, such as style transfer, data augmentation, or image restoration.

( Image credit: Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks )

Benchmarks

Add a Result

These leaderboards are used to track progress in Image-to-Image Translation

Dataset	Best Model	Compare
SYNTHIA-to-Cityscapes	HRDA + PiPa	See all
GTAV-to-Cityscapes Labels	MIC	See all
Cityscapes Labels-to-Photo	DP-SIMS (ConvNext-L)	See all
ADE20K Labels-to-Photos	DP-SIMS (ConvNext-L)	See all
COCO-Stuff Labels-to-Photos	DP-SIMS (ConvNext-XL)	See all
ADE20K-Outdoor Labels-to-Photos	DP-GAN	See all
IXI	ResViT	See all
CelebA-HQ	StarGAN v2	See all
Cityscapes-to-Foggy Cityscapes	MIC	See all
Cityscapes Photo-to-Labels	pix2pix	See all
cat2dog	GNR	See all
RaFD	StarGAN	See all
selfie2anime	GNR	See all
LLVIP	pyramidpix2pix	See all
BCI	pyramidpix2pix	See all
horse2zebra	U-GAT-IT	See all
photo2vangogh	U-GAT-IT	See all
zebra2horse	U-GAT-IT	See all
vangogh2photo	U-GAT-IT	See all
SYNTHIA Fall-to-Winter	CyCADA	See all
Aerial-to-Map	cGAN	See all
selfie-to-anime	FQ-GAN	See all
anime-to-selfie	FQ-GAN	See all
AFHQ	StarGAN v2	See all
Deep-Fashion	INADE	See all
Object Transfiguration (sheep-to-giraffe)	InstaGAN	See all
ADE-Indoor Labels-to-Photo	SB-GAN	See all
photo2portrait	U-GAT-IT	See all
dog2cat	U-GAT-IT	See all
portrait2photo	U-GAT-IT	See all
KITTI Object Tracking Evaluation 2012	SRNet	See all
Zebra and Horses	Shared discriminator GAN	See all
Apples and Oranges	Shared discriminator GAN	See all
2017_test set	hi	See all
BRATS	ResViT	See all
AFHQ (Cat to Dog)	EGSDE	See all
AFHQ (Wild to Dog)	EGSDE	See all

Show all 37 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Image-to-Image Translation models and implementations

eriklindernoren/PyTorch-GAN

8 papers

15,821

Wenchao-Du/LIR-for-Unsupervised-IR

3 papers

yaxingwang/SEMIT

3 papers

ganslate-team/ganslate

3 papers

See all 16 libraries.

Datasets

Subtasks

Cross-View Image-to-Image Translation

Fundus to Angiography Generation

Facial Makeup Transfer

Real-to-Cartoon translation

Photo-To-Caricature Translation

Bird View Synthesis

Latest papers

Most implemented Social Latest No code

Fine-grained Appearance Transfer with Diffusion Models

babahui/fine-grained-appearance-transfer • • 27 Nov 2023

A pivotal aspect of our approach is the strategic use of the predicted $x_0$ space by diffusion models within the latent space of diffusion processes.

27 Nov 2023

Paper
Code

A deep learning approach for marine snow synthesis and removal

fergaletto/mssr • 27 Nov 2023

Marine snow, the floating particles in underwater images, severely degrades the visibility and performance of human and machine vision systems.

27 Nov 2023

Paper
Code

H-Packer: Holographic Rotationally Equivariant Convolutional Neural Network for Protein Side-Chain Packing

gvisani/hpacker • • 15 Nov 2023

Accurately modeling protein 3D structure is essential for the design of functional proteins.

15 Nov 2023

Paper
Code

Optimal Transport-Guided Conditional Score-Based Diffusion Models

xjtu-xgu/otcs • • 2 Nov 2023

Conditional score-based diffusion model (SBDM) is for conditional generation of target data with paired data as condition, and has achieved great success in image translation.

02 Nov 2023

Paper
Code

TPSeNCE: Towards Artifact-Free Realistic Rain Generation for Deraining and Object Detection in Rain

shenzheng2000/tpsence • • 1 Nov 2023

We first introduce a Triangular Probability Similarity (TPS) constraint to guide the generated images toward clear and rainy images in the discriminator manifold, thereby minimizing artifacts and distortions during rain generation.

01 Nov 2023

Paper
Code

Adaptive Latent Diffusion Model for 3D Medical Image to Image Translation: Multi-modal Magnetic Resonance Imaging Study

jongdory/aldm • • 1 Nov 2023

Our model exhibited successful image synthesis across different source-target modality scenarios and surpassed other models in quantitative evaluations tested on multi-modal brain magnetic resonance imaging datasets of four different modalities and an independent IXI dataset.

01 Nov 2023

Paper
Code

WAIT: Feature Warping for Animation to Illustration video Translation using GANs

giddyyupp/wait • • 7 Oct 2023

Current state-of-the-art video-to-video translation models rely on having a video sequence or a single style image to stylize an input video.

07 Oct 2023

Paper
Code

VI-Diff: Unpaired Visible-Infrared Translation Diffusion Model for Single Modality Labeled Visible-Infrared Person Re-identification

hanhuang22/VI-Diff • • 6 Oct 2023

In this paper, we propose VI-Diff, a diffusion model that effectively addresses the task of Visible-Infrared person image translation.

06 Oct 2023

Paper
Code

Image-to-Image Translation with Deep Reinforcement Learning

Algolzw/SPAC-Deformable-Registration • • 24 Sep 2023

The key feature in the RL-I2IT framework is to decompose a monolithic learning process into small steps with a lightweight model to progressively transform a source image successively to a target image.

24 Sep 2023

Paper
Code

Masked Discriminators for Content-Consistent Unpaired Image-to-Image Translation

bonifazstuhr/feamgan • • 22 Sep 2023

In this work, we show that masking the inputs of a global discriminator for both domains with a content-based mask is sufficient to reduce content inconsistencies significantly.

22 Sep 2023

Paper
Code

Image-to-Image Translation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result