no code implementations • 28 Aug 2021 • Gouthaman KV, Anurag Mittal
This paper shows that the sequence model architecture used in the question-encoder has a significant role in the generalizability of VQA models.
no code implementations • 18 Aug 2020 • Gouthaman KV, Athira Nambiar, Kancheti Sai Srinivas, Anurag Mittal
Humans perform such a correlation with a strong linguistic understanding of the visual world.
no code implementations • ECCV 2020 • Gouthaman KV, Anurag Mittal
We demonstrate the effect of VGQE on three recent VQA models and achieve state-of-the-art results on the bias-sensitive split of the VQAv2 dataset; VQA-CPv2.