TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Layout-to-Image Generation	COCO-Stuff 64x64	Layout2Im	FID	38.14	# 3
Layout-to-Image Generation	COCO-Stuff 64x64	Layout2Im	Inception Score	9.1	# 4
Layout-to-Image Generation	Visual Genome 64x64	Layout2Im	FID	31.25	# 2
Layout-to-Image Generation	Visual Genome 64x64	Layout2Im	Inception Score	8.1	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/image-generation-from-layout/layout-to-image-generation-on-visual-genome-2)](https://paperswithcode.com/sota/layout-to-image-generation-on-visual-genome-2?p=image-generation-from-layout)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/image-generation-from-layout/layout-to-image-generation-on-coco-stuff-2)](https://paperswithcode.com/sota/layout-to-image-generation-on-coco-stuff-2?p=image-generation-from-layout)`

Image Generation from Layout

CVPR 2019 · Bo Zhao, Lili Meng, Weidong Yin, Leonid Sigal ·

Despite significant recent progress on generative models, controlled generation of images depicting multiple and complex object layouts is still a difficult problem. Among the core challenges are the diversity of appearance a given object may possess and, as a result, exponential set of images consistent with a specified layout. To address these challenges, we propose a novel approach for layout-based image generation; we call it Layout2Im. Given the coarse spatial layout (bounding boxes + object categories), our model can generate a set of realistic images which have the correct objects in the desired locations. The representation of each object is disentangled into a specified/certain part (category) and an unspecified/uncertain part (appearance). The category is encoded using a word embedding and the appearance is distilled into a low-dimensional vector sampled from a normal distribution. Individual object representations are composed together using convolutional LSTM, to obtain an encoding of the complete layout, and then decoded to an image. Several loss terms are introduced to encourage accurate and diverse generation. The proposed Layout2Im model significantly outperforms the previous state of the art, boosting the best reported inception score by 24.66% and 28.57% on the very challenging COCO-Stuff and Visual Genome datasets, respectively. Extensive experiments also demonstrate our method's ability to generate complex and diverse images with multiple objects.

PDF Abstract CVPR 2019 PDF CVPR 2019 Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Layout-to-Image Generation

Object

Datasets

Visual Genome

COCO-Stuff

Results from the Paper

Edit

Ranked #2 on Layout-to-Image Generation on Visual Genome 64x64

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Layout-to-Image Generation	COCO-Stuff 64x64	Layout2Im	FID	38.14	# 3	Compare
Layout-to-Image Generation	COCO-Stuff 64x64	Layout2Im	Inception Score	9.1	# 4	Compare
Layout-to-Image Generation	Visual Genome 64x64	Layout2Im	FID	31.25	# 2	Compare
Layout-to-Image Generation	Visual Genome 64x64	Layout2Im	Inception Score	8.1	# 3	Compare

Methods

Add Remove

LSTM • Sigmoid Activation • Tanh Activation

Edit Social Preview

Image Generation from Layout

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove