no code implementations • 18 Aug 2020 • Gouthaman KV, Athira Nambiar, Kancheti Sai Srinivas, Anurag Mittal
Humans perform such a correlation with a strong linguistic understanding of the visual world.
Image Captioning Visual Question Answering (VQA)