iCap: Interactive Image Captioning with Predictive Text

In this paper we study a brand new topic of interactive image captioning with human in the loop. Different from automated image captioning where a given test image is the sole input in the inference stage, we have access to both the test image and a sequence of (incomplete) user-input sentences in the interactive scenario... (read more)

Results in Papers With Code
(↓ scroll down to see all results)