TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Cross-View Image-to-Image Translation	cvusa	UniGAN	SSIM	0.5366	# 1
Cross-View Image-to-Image Translation	cvusa	UniGAN	KL	2.6	# 1
Cross-View Image-to-Image Translation	cvusa	UniGAN	PSNR	22.8223	# 1
Cross-View Image-to-Image Translation	cvusa	UniGAN	SD	19.8276	# 1
Cross-View Image-to-Image Translation	Dayton (256×256) - aerial-to-ground	UniGAN	SSIM	0.3357	# 6
Cross-View Image-to-Image Translation	Dayton (256×256) - aerial-to-ground	UniGAN	KL	5.17	# 1
Cross-View Image-to-Image Translation	Dayton (256×256) - aerial-to-ground	UniGAN	PSNR	22.0273	# 2
Cross-View Image-to-Image Translation	Dayton (256×256) - aerial-to-ground	UniGAN	SD	17.6542	# 2
Cross-View Image-to-Image Translation	Dayton (64×64) - aerial-to-ground	UniGAN	SSIM	0.5064	# 3
Cross-View Image-to-Image Translation	Dayton (64×64) - aerial-to-ground	UniGAN	KL	2.16	# 1
Cross-View Image-to-Image Translation	Dayton (64×64) - aerial-to-ground	UniGAN	LPIPS	0.3817	# 1
Cross-View Image-to-Image Translation	Dayton (64×64) - aerial-to-ground	UniGAN	PSNR	23.3632	# 1
Cross-View Image-to-Image Translation	Dayton (64×64) - aerial-to-ground	UniGAN	SD	16.4788	# 1
Cross-View Image-to-Image Translation	Dayton (64x64) - ground-to-aerial	UniGAN	LPIPS	0.4527	# 1
Gesture-to-Gesture Translation	NTU Hand Digit	UniGAN	PSNR	32.6574	# 1
Gesture-to-Gesture Translation	NTU Hand Digit	UniGAN	IS	2.3783	# 6
Gesture-to-Gesture Translation	NTU Hand Digit	UniGAN	AMT	29.3	# 1
Gesture-to-Gesture Translation	NTU Hand Digit	UniGAN	FID	6.7493	# 1
Gesture-to-Gesture Translation	NTU Hand Digit	UniGAN	FRD	1.7401	# 1
Gesture-to-Gesture Translation	Senz3D	UniGAN	PSNR	31.542	# 1
Gesture-to-Gesture Translation	Senz3D	UniGAN	IS	2.2159	# 6
Gesture-to-Gesture Translation	Senz3D	UniGAN	AMT	27.6	# 1
Gesture-to-Gesture Translation	Senz3D	UniGAN	FID	12.4465	# 1
Gesture-to-Gesture Translation	Senz3D	UniGAN	FRD	2.2104	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/unified-generative-adversarial-networks-for/cross-view-image-to-image-translation-on-4)](https://paperswithcode.com/sota/cross-view-image-to-image-translation-on-4?p=unified-generative-adversarial-networks-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/unified-generative-adversarial-networks-for/cross-view-image-to-image-translation-on-1)](https://paperswithcode.com/sota/cross-view-image-to-image-translation-on-1?p=unified-generative-adversarial-networks-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/unified-generative-adversarial-networks-for/cross-view-image-to-image-translation-on)](https://paperswithcode.com/sota/cross-view-image-to-image-translation-on?p=unified-generative-adversarial-networks-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/unified-generative-adversarial-networks-for/cross-view-image-to-image-translation-on-2)](https://paperswithcode.com/sota/cross-view-image-to-image-translation-on-2?p=unified-generative-adversarial-networks-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/unified-generative-adversarial-networks-for/gesture-to-gesture-translation-on-ntu-hand)](https://paperswithcode.com/sota/gesture-to-gesture-translation-on-ntu-hand?p=unified-generative-adversarial-networks-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/unified-generative-adversarial-networks-for/gesture-to-gesture-translation-on-senz3d)](https://paperswithcode.com/sota/gesture-to-gesture-translation-on-senz3d?p=unified-generative-adversarial-networks-for)`

Unified Generative Adversarial Networks for Controllable Image-to-Image Translation

12 Dec 2019 · Hao Tang, Hong Liu, Nicu Sebe ·

We propose a unified Generative Adversarial Network (GAN) for controllable image-to-image translation, i.e., transferring an image from a source to a target domain guided by controllable structures. In addition to conditioning on a reference image, we show how the model can generate images conditioned on controllable structures, e.g., class labels, object keypoints, human skeletons, and scene semantic maps. The proposed model consists of a single generator and a discriminator taking a conditional image and the target controllable structure as input. In this way, the conditional image can provide appearance information and the controllable structure can provide the structure information for generating the target result. Moreover, our model learns the image-to-image mapping through three novel losses, i.e., color loss, controllable structure guided cycle-consistency loss, and controllable structure guided self-content preserving loss. Also, we present the Fr\'echet ResNet Distance (FRD) to evaluate the quality of the generated images. Experiments on two challenging image translation tasks, i.e., hand gesture-to-gesture translation and cross-view image translation, show that our model generates convincing results, and significantly outperforms other state-of-the-art methods on both tasks. Meanwhile, the proposed framework is a unified solution, thus it can be applied to solving other controllable structure guided image translation tasks such as landmark guided facial expression translation and keypoint guided person image generation. To the best of our knowledge, we are the first to make one GAN framework work on all such controllable structure guided image translation tasks. Code is available at https://github.com/Ha0Tang/GestureGAN.

PDF Abstract

Code

Add Remove Mark official

Ha0Tang/GestureGAN official

173

Tasks

Add Remove

Facial Expression Translation

Generative Adversarial Network

Gesture-to-Gesture Translation

Image Generation

Image-to-Image Translation

Translation

Datasets

Perceptual Similarity CVUSA

Dayton

Results from the Paper

Edit

Ranked #1 on Cross-View Image-to-Image Translation on cvusa

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Cross-View Image-to-Image Translation	cvusa	UniGAN	SSIM	0.5366	# 1	Compare
			KL	2.6	# 1	Compare
			PSNR	22.8223	# 1	Compare
			SD	19.8276	# 1	Compare
Cross-View Image-to-Image Translation	Dayton (256×256) - aerial-to-ground	UniGAN	SSIM	0.3357	# 6	Compare
			KL	5.17	# 1	Compare
			PSNR	22.0273	# 2	Compare
			SD	17.6542	# 2	Compare
Cross-View Image-to-Image Translation	Dayton (64×64) - aerial-to-ground	UniGAN	SSIM	0.5064	# 3	Compare
			KL	2.16	# 1	Compare
			LPIPS	0.3817	# 1	Compare
			PSNR	23.3632	# 1	Compare
			SD	16.4788	# 1	Compare
Cross-View Image-to-Image Translation	Dayton (64x64) - ground-to-aerial	UniGAN	LPIPS	0.4527	# 1	Compare
Gesture-to-Gesture Translation	NTU Hand Digit	UniGAN	PSNR	32.6574	# 1	Compare
			IS	2.3783	# 6	Compare
			AMT	29.3	# 1	Compare
			FID	6.7493	# 1	Compare
			FRD	1.7401	# 1	Compare
Gesture-to-Gesture Translation	Senz3D	UniGAN	PSNR	31.542	# 1	Compare
			IS	2.2159	# 6	Compare
			AMT	27.6	# 1	Compare
			FID	12.4465	# 1	Compare
			FRD	2.2104	# 1	Compare

Methods

Add Remove

1x1 Convolution • Average Pooling • Batch Normalization • Bottleneck Residual Block • Convolution • GAN • Global Average Pooling • Kaiming Initialization • Max Pooling • ReLU • Residual Block • Residual Connection • ResNet

Edit Social Preview

Unified Generative Adversarial Networks for Controllable Image-to-Image Translation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove