TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image-to-Image Translation	ADE20K Labels-to-Photos	pix2pixHD	mIoU	20.3	# 9
Image-to-Image Translation	ADE20K Labels-to-Photos	pix2pixHD	Accuracy	69.2%	# 6
Image-to-Image Translation	ADE20K Labels-to-Photos	pix2pixHD	FID	81.8	# 14
Image-to-Image Translation	ADE20K-Outdoor Labels-to-Photos	pix2pixHD	mIoU	17.4	# 4
Image-to-Image Translation	ADE20K-Outdoor Labels-to-Photos	pix2pixHD	Accuracy	71.6%	# 3
Image-to-Image Translation	ADE20K-Outdoor Labels-to-Photos	pix2pixHD	FID	97.8	# 6
Image-to-Image Translation	Cityscapes Labels-to-Photo	pix2pixHD	Per-pixel Accuracy	81.4%	# 5
Image-to-Image Translation	Cityscapes Labels-to-Photo	pix2pixHD	mIoU	58.3	# 9
Image-to-Image Translation	Cityscapes Labels-to-Photo	pix2pixHD	FID	95	# 14
Sketch-to-Image Translation	COCO-Stuff	Pix2PixHD	FID	38.7	# 2
Sketch-to-Image Translation	COCO-Stuff	Pix2PixHD	FID-C	27.1	# 2
Image-to-Image Translation	COCO-Stuff Labels-to-Photos	pix2pixHD	mIoU	14.6	# 7
Image-to-Image Translation	COCO-Stuff Labels-to-Photos	pix2pixHD	Accuracy	45.8%	# 5
Image-to-Image Translation	COCO-Stuff Labels-to-Photos	pix2pixHD	FID	111.5	# 14
Fundus to Angiography Generation	Fundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic Patients	pix2pixHD	FID	42.8	# 7
Fundus to Angiography Generation	Fundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic Patients	pix2pixHD	Kernel Inception Distance	0.00258	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/high-resolution-image-synthesis-and-semantic/sketch-to-image-translation-on-coco-stuff)](https://paperswithcode.com/sota/sketch-to-image-translation-on-coco-stuff?p=high-resolution-image-synthesis-and-semantic)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/high-resolution-image-synthesis-and-semantic/image-to-image-translation-on-ade20k-outdoor)](https://paperswithcode.com/sota/image-to-image-translation-on-ade20k-outdoor?p=high-resolution-image-synthesis-and-semantic)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/high-resolution-image-synthesis-and-semantic/fundus-to-angiography-generation-on-fundus)](https://paperswithcode.com/sota/fundus-to-angiography-generation-on-fundus?p=high-resolution-image-synthesis-and-semantic)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/high-resolution-image-synthesis-and-semantic/image-to-image-translation-on-ade20k-labels)](https://paperswithcode.com/sota/image-to-image-translation-on-ade20k-labels?p=high-resolution-image-synthesis-and-semantic)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/high-resolution-image-synthesis-and-semantic/image-to-image-translation-on-cityscapes)](https://paperswithcode.com/sota/image-to-image-translation-on-cityscapes?p=high-resolution-image-synthesis-and-semantic)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/high-resolution-image-synthesis-and-semantic/image-to-image-translation-on-coco-stuff)](https://paperswithcode.com/sota/image-to-image-translation-on-coco-stuff?p=high-resolution-image-synthesis-and-semantic)`

High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs

CVPR 2018 · Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Andrew Tao, Jan Kautz, Bryan Catanzaro ·

We present a new method for synthesizing high-resolution photo-realistic images from semantic label maps using conditional generative adversarial networks (conditional GANs). Conditional GANs have enabled a variety of applications, but the results are often limited to low-resolution and still far from realistic. In this work, we generate 2048x1024 visually appealing results with a novel adversarial loss, as well as new multi-scale generator and discriminator architectures. Furthermore, we extend our framework to interactive visual manipulation with two additional features. First, we incorporate object instance segmentation information, which enables object manipulations such as removing/adding objects and changing the object category. Second, we propose a method to generate diverse results given the same input, allowing users to edit the object appearance interactively. Human opinion studies demonstrate that our method significantly outperforms existing methods, advancing both the quality and the resolution of deep image synthesis and editing.

PDF Abstract CVPR 2018 PDF CVPR 2018 Abstract

Code

Add Remove Mark official

NVIDIA/pix2pixHD official

6,524

mingyuliutw/UNIT

1,967

moabarar/nemar

162

UBC-Computer-Vision-Group/DwNet

ubc-vision/DwNet

See all 20 implementations

Tasks

Add Remove

Conditional Image Generation

Fundus to Angiography Generation

Image Generation

Image-to-Image Translation

Instance Segmentation

Object

Semantic Segmentation

Sketch-to-Image Translation

Vocal Bursts Intensity Prediction

Datasets

Cityscapes

ADE20K

CelebA-HQ

COCO-Stuff

Helen

Results from the Paper

Edit

Ranked #2 on Sketch-to-Image Translation on COCO-Stuff

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image-to-Image Translation	ADE20K-Outdoor Labels-to-Photos	pix2pixHD	mIoU	17.4	# 4	Compare
			Accuracy	71.6%	# 3	Compare
			FID	97.8	# 6	Compare
Image-to-Image Translation	Cityscapes Labels-to-Photo	pix2pixHD	Per-pixel Accuracy	81.4%	# 5	Compare
			mIoU	58.3	# 9	Compare
			FID	95	# 14	Compare
Sketch-to-Image Translation	COCO-Stuff	Pix2PixHD	FID	38.7	# 2	Compare
Sketch-to-Image Translation	COCO-Stuff	Pix2PixHD	FID-C	27.1	# 2	Compare
Fundus to Angiography Generation	Fundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic Patients	pix2pixHD	FID	42.8	# 7	Compare
Fundus to Angiography Generation		pix2pixHD	Kernel Inception Distance	0.00258	# 6	Compare

Results from Other Papers

Task	Dataset	Model	Metric Name	Metric Value	Rank	Compare
Image-to-Image Translation	ADE20K Labels-to-Photos	pix2pixHD	mIoU	20.3	# 9	See all
			Accuracy	69.2%	# 6	See all
			FID	81.8	# 14	See all
Image-to-Image Translation	COCO-Stuff Labels-to-Photos	pix2pixHD	mIoU	14.6	# 7	See all
			Accuracy	45.8%	# 5	See all
			FID	111.5	# 14	See all

Methods

Add Remove

Batch Normalization • Concatenated Skip Connection • Convolution • Dropout • Leaky ReLU • PatchGAN • Pix2Pix • ReLU • Sigmoid Activation

Edit Social Preview

High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit