TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image-to-Image Translation	ADE20K Labels-to-Photos	SPADE + SESAME	mIoU	49	# 3
Image-to-Image Translation	ADE20K Labels-to-Photos	SPADE + SESAME	Accuracy	85.5%	# 1
Image-to-Image Translation	ADE20K Labels-to-Photos	SPADE + SESAME	FID	31.9	# 6
Image-to-Image Translation	Cityscapes Labels-to-Photo	SPADE + SESAME	Per-pixel Accuracy	82.5%	# 1
Image-to-Image Translation	Cityscapes Labels-to-Photo	SPADE + SESAME	mIoU	66	# 4
Image-to-Image Translation	Cityscapes Labels-to-Photo	SPADE + SESAME	FID	54.2	# 8

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/sesame-semantic-editing-of-scenes-by-adding/image-to-image-translation-on-ade20k-labels)](https://paperswithcode.com/sota/image-to-image-translation-on-ade20k-labels?p=sesame-semantic-editing-of-scenes-by-adding)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/sesame-semantic-editing-of-scenes-by-adding/image-to-image-translation-on-cityscapes)](https://paperswithcode.com/sota/image-to-image-translation-on-cityscapes?p=sesame-semantic-editing-of-scenes-by-adding)`

SESAME: Semantic Editing of Scenes by Adding, Manipulating or Erasing Objects

ECCV 2020 · Evangelos Ntavelis, Andrés Romero, Iason Kastanis, Luc van Gool, Radu Timofte ·

Recent advances in image generation gave rise to powerful tools for semantic image editing. However, existing approaches can either operate on a single image or require an abundance of additional information. They are not capable of handling the complete set of editing operations, that is addition, manipulation or removal of semantic concepts. To address these limitations, we propose SESAME, a novel generator-discriminator pair for Semantic Editing of Scenes by Adding, Manipulating or Erasing objects. In our setup, the user provides the semantic labels of the areas to be edited and the generator synthesizes the corresponding pixels. In contrast to previous methods that employ a discriminator that trivially concatenates semantics and image as an input, the SESAME discriminator is composed of two input streams that independently process the image and its semantics, using the latter to manipulate the results of the former. We evaluate our model on a diverse set of datasets and report state-of-the-art performance on two tasks: (a) image manipulation and (b) image generation conditioned on semantic labels.

PDF Abstract ECCV 2020 PDF ECCV 2020 Abstract

Code

Add Remove Mark official

vglsd/OpenSESAME official

Tasks

Add Remove

Image Generation

Image Manipulation

Image-to-Image Translation

Datasets

Cityscapes

ADE20K

Results from the Paper

Edit

Ranked #3 on Image-to-Image Translation on ADE20K Labels-to-Photos

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image-to-Image Translation	ADE20K Labels-to-Photos	SPADE + SESAME	mIoU	49	# 3	Compare
			Accuracy	85.5%	# 1	Compare
			FID	31.9	# 6	Compare
Image-to-Image Translation	Cityscapes Labels-to-Photo	SPADE + SESAME	Per-pixel Accuracy	82.5%	# 1	Compare
			mIoU	66	# 4	Compare
			FID	54.2	# 8	Compare

Methods

Add Remove

SESAME Discriminator

Edit Social Preview

SESAME: Semantic Editing of Scenes by Adding, Manipulating or Erasing Objects

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove