TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Layout-to-Image Generation	COCO-Stuff 256x256	AttSPADE	Inception Score	15.6	# 2
Layout-to-Image Generation	COCO-Stuff 256x256	AttSPADE	FID	54.7	# 5
Layout-to-Image Generation	COCO-Stuff 256x256	AttSPADE	LPIPS	0.44	# 1
Layout-to-Image Generation	Visual Genome 256x256	AttSPADE	Inception Score	11	# 3
Layout-to-Image Generation	Visual Genome 256x256	AttSPADE	FID	36.4	# 3
Layout-to-Image Generation	Visual Genome 256x256	AttSPADE	LPIPS	0.51	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-canonical-representations-for-scene/layout-to-image-generation-on-visual-genome-4)](https://paperswithcode.com/sota/layout-to-image-generation-on-visual-genome-4?p=learning-canonical-representations-for-scene)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-canonical-representations-for-scene/layout-to-image-generation-on-coco-stuff-4)](https://paperswithcode.com/sota/layout-to-image-generation-on-coco-stuff-4?p=learning-canonical-representations-for-scene)`

Learning Canonical Representations for Scene Graph to Image Generation

ECCV 2020 · Roei Herzig, Amir Bar, Huijuan Xu, Gal Chechik, Trevor Darrell, Amir Globerson ·

Generating realistic images of complex visual scenes becomes challenging when one wishes to control the structure of the generated images. Previous approaches showed that scenes with few entities can be controlled using scene graphs, but this approach struggles as the complexity of the graph (the number of objects and edges) increases. In this work, we show that one limitation of current methods is their inability to capture semantic equivalence in graphs. We present a novel model that addresses these issues by learning canonical graph representations from the data, resulting in improved image generation for complex visual scenes. Our model demonstrates improved empirical performance on large scene graphs, robustness to noise in the input scene graph, and generalization on semantically equivalent graphs. Finally, we show improved performance of the model on three different benchmarks: Visual Genome, COCO, and CLEVR.

PDF Abstract ECCV 2020 PDF ECCV 2020 Abstract

Code

Add Remove Mark official

roeiherz/CanonicalSg2Im official

roeiherz/AG2Video

Tasks

Add Remove

Image Generation

Layout-to-Image Generation

Scene Generation

Datasets

MS COCO

Visual Genome

CLEVR

COCO-Stuff

Results from the Paper

Edit

Ranked #3 on Layout-to-Image Generation on Visual Genome 256x256

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Layout-to-Image Generation	COCO-Stuff 256x256	AttSPADE	Inception Score	15.6	# 2	Compare
			FID	54.7	# 5	Compare
			LPIPS	0.44	# 1	Compare
Layout-to-Image Generation	Visual Genome 256x256	AttSPADE	Inception Score	11	# 3	Compare
			FID	36.4	# 3	Compare
			LPIPS	0.51	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Learning Canonical Representations for Scene Graph to Image Generation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove