ChatPainter: Improving Text to Image Generation using Dialogue

22 Feb 2018  ·  Shikhar Sharma, Dendi Suhubdy, Vincent Michalski, Samira Ebrahimi Kahou, Yoshua Bengio ·

Synthesizing realistic images from text descriptions on a dataset like Microsoft Common Objects in Context (MS COCO), where each image can contain several objects, is a challenging task. Prior work has used text captions to generate images. However, captions might not be informative enough to capture the entire image and insufficient for the model to be able to understand which objects in the images correspond to which words in the captions. We show that adding a dialogue that further describes the scene leads to significant improvement in the inception score and in the quality of generated images on the MS COCO dataset.

PDF Abstract

Results from the Paper


Ranked #25 on Text-to-Image Generation on MS COCO (Inception score metric)

     Get a GitHub badge
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Text-to-Image Generation MS COCO ChatPainter Inception score 9.74 # 24

Methods


No methods listed for this paper. Add relevant methods here