TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Multimodal Unsupervised Image-To-Image Translation	AFHQ	DRIT	FID	95.6	# 4
Multimodal Unsupervised Image-To-Image Translation	CelebA-HQ	DRIT	FID	52.1	# 4
Synthetic-to-Real Translation	GTAV-to-Cityscapes Labels	Domain adaptation	mIoU	43.2	# 60

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/diverse-image-to-image-translation-via/multimodal-unsupervised-image-to-image-5)](https://paperswithcode.com/sota/multimodal-unsupervised-image-to-image-5?p=diverse-image-to-image-translation-via)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/diverse-image-to-image-translation-via/multimodal-unsupervised-image-to-image-4)](https://paperswithcode.com/sota/multimodal-unsupervised-image-to-image-4?p=diverse-image-to-image-translation-via)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/diverse-image-to-image-translation-via/synthetic-to-real-translation-on-gtav-to)](https://paperswithcode.com/sota/synthetic-to-real-translation-on-gtav-to?p=diverse-image-to-image-translation-via)`

Diverse Image-to-Image Translation via Disentangled Representations

ECCV 2018 · Hsin-Ying Lee, Hung-Yu Tseng, Jia-Bin Huang, Maneesh Kumar Singh, Ming-Hsuan Yang ·

Image-to-image translation aims to learn the mapping between two visual domains. There are two main challenges for many applications: 1) the lack of aligned training pairs and 2) multiple possible outputs from a single input image. In this work, we present an approach based on disentangled representation for producing diverse outputs without paired training images. To achieve diversity, we propose to embed images onto two spaces: a domain-invariant content space capturing shared information across domains and a domain-specific attribute space. Our model takes the encoded content features extracted from a given input and the attribute vectors sampled from the attribute space to produce diverse outputs at test time. To handle unpaired training data, we introduce a novel cross-cycle consistency loss based on disentangled representations. Qualitative results show that our model can generate diverse and realistic images on a wide range of tasks without paired training data. For quantitative comparisons, we measure realism with user study and diversity with a perceptual distance metric. We apply the proposed model to domain adaptation and show competitive performance when compared to the state-of-the-art on the MNIST-M and the LineMod datasets.

PDF Abstract ECCV 2018 PDF ECCV 2018 Abstract

Code

Add Remove Mark official

HsinYingLee/DRIT official

832

taki0112/DRIT-Tensorflow

118

HsinYingLee/MDMM

109

Wenchao-Du/LIR-for-Unsupervised-IR

hytseng0509/DRIT_hr

See all 7 implementations

Tasks

Add Remove

Attribute

Domain Adaptation

Image-to-Image Translation

Multimodal Unsupervised Image-To-Image Translation

Perceptual Distance

Synthetic-to-Real Translation

Translation

Datasets

MNIST

CelebA-HQ

GTA5

AFHQ

MNIST-M

Results from the Paper

Edit

Ranked #4 on Multimodal Unsupervised Image-To-Image Translation on CelebA-HQ

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Multimodal Unsupervised Image-To-Image Translation	AFHQ	DRIT	FID	95.6	# 4	Compare
Multimodal Unsupervised Image-To-Image Translation	CelebA-HQ	DRIT	FID	52.1	# 4	Compare
Synthetic-to-Real Translation	GTAV-to-Cityscapes Labels	Domain adaptation	mIoU	43.2	# 60	Compare

Methods

Add Remove

1x1 Convolution • Average Pooling • Batch Normalization • Bottleneck Residual Block • Convolution • Global Average Pooling • Kaiming Initialization • Max Pooling • ReLU • Residual Block • Residual Connection • ResNet

Edit Social Preview

Diverse Image-to-Image Translation via Disentangled Representations

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove