TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Layout-to-Image Generation	Visual Genome 128x128	PLGAN	FID	20.62	# 2
Layout-to-Image Generation	Visual Genome 128x128	PLGAN	Inception Score	10.6	# 4
Layout-to-Image Generation	Visual Genome 256x256	PLGAN	Inception Score	13.2	# 2
Layout-to-Image Generation	Visual Genome 256x256	PLGAN	FID	28.06	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/interactive-image-synthesis-with-panoptic/layout-to-image-generation-on-visual-genome-3)](https://paperswithcode.com/sota/layout-to-image-generation-on-visual-genome-3?p=interactive-image-synthesis-with-panoptic)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/interactive-image-synthesis-with-panoptic/layout-to-image-generation-on-visual-genome-4)](https://paperswithcode.com/sota/layout-to-image-generation-on-visual-genome-4?p=interactive-image-synthesis-with-panoptic)`

Interactive Image Synthesis with Panoptic Layout Generation

CVPR 2022 · Bo wang, Tao Wu, Minfeng Zhu, Peng Du ·

Interactive image synthesis from user-guided input is a challenging task when users wish to control the scene structure of a generated image with ease.Although remarkable progress has been made on layout-based image synthesis approaches, in order to get realistic fake image in interactive scene, existing methods require high-precision inputs, which probably need adjustment several times and are unfriendly to novice users. When placement of bounding boxes is subject to perturbation, layout-based models suffer from "missing regions" in the constructed semantic layouts and hence undesirable artifacts in the generated images. In this work, we propose Panoptic Layout Generative Adversarial Networks (PLGAN) to address this challenge. The PLGAN employs panoptic theory which distinguishes object categories between "stuff" with amorphous boundaries and "things" with well-defined shapes, such that stuff and instance layouts are constructed through separate branches and later fused into panoptic layouts. In particular, the stuff layouts can take amorphous shapes and fill up the missing regions left out by the instance layouts. We experimentally compare our PLGAN with state-of-the-art layout-based models on the COCO-Stuff, Visual Genome, and Landscape datasets. The advantages of PLGAN are not only visually demonstrated but quantitatively verified in terms of inception score, Fr\'echet inception distance, classification accuracy score, and coverage.

PDF Abstract CVPR 2022 PDF CVPR 2022 Abstract

Code

Add Remove Mark official

wb-finalking/PLGAN official

Tasks

Add Remove

Image Generation

Layout-to-Image Generation

Datasets

Visual Genome

Results from the Paper

Edit

Ranked #2 on Layout-to-Image Generation on Visual Genome 128x128

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Layout-to-Image Generation	Visual Genome 128x128	PLGAN	FID	20.62	# 2	Compare
Layout-to-Image Generation	Visual Genome 128x128	PLGAN	Inception Score	10.6	# 4	Compare
Layout-to-Image Generation	Visual Genome 256x256	PLGAN	Inception Score	13.2	# 2	Compare
Layout-to-Image Generation	Visual Genome 256x256	PLGAN	FID	28.06	# 2	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Interactive Image Synthesis with Panoptic Layout Generation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove