Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation

13 Apr 2023  ·  Jaemin Cho, Linjie Li, Zhengyuan Yang, Zhe Gan, Lijuan Wang, Mohit Bansal ·

Spatial control is a core capability in controllable image generation. Advancements in layout-guided image generation have shown promising results on in-distribution (ID) datasets with similar spatial configurations. However, it is unclear how these models perform when facing out-of-distribution (OOD) samples with arbitrary, unseen layouts. In this paper, we propose LayoutBench, a diagnostic benchmark for layout-guided image generation that examines four categories of spatial control skills: number, position, size, and shape. We benchmark two recent representative layout-guided image generation methods and observe that the good ID layout control may not generalize well to arbitrary layouts in the wild (e.g., objects at the boundary). Next, we propose IterInpaint, a new baseline that generates foreground and background regions in a step-by-step manner via inpainting, demonstrating stronger generalizability than existing models on OOD layouts in LayoutBench. We perform quantitative and qualitative evaluation and fine-grained analysis on the four LayoutBench skills to pinpoint the weaknesses of existing models. Lastly, we show comprehensive ablation studies on IterInpaint, including training task ratio, crop&paste vs. repaint, and generation order. Project website: https://layoutbench.github.io

PDF Abstract

Datasets


Introduced in the Paper:

LayoutBench

Used in the Paper:

CLEVR

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Layout-to-Image Generation LayoutBench IterInpaint AP 36.5 # 1
Layout-to-Image Generation LayoutBench ReCo AP 7.6 # 3
Layout-to-Image Generation LayoutBench LDM AP 9.9 # 2

Methods