2 code implementations • 1 Nov 2022 • David Nukrai, Ron Mokady, Amir Globerson
We consider the task of image-captioning using only the CLIP model and additional text data at training time, and no additional captioned images.
Ranked #1 on Image Captioning on MSCOCO
Language Modelling Semi Supervised Learning for Image Captioning