Search Results for author: Gouthaman KV

Found 3 papers, 0 papers with code

On the Significance of Question Encoder Sequence Model in the Out-of-Distribution Performance in Visual Question Answering

no code implementations • 28 Aug 2021 • Gouthaman KV, Anurag Mittal

This paper shows that the sequence model architecture used in the question-encoder has a significant role in the generalizability of VQA models.

Graph Attention Question Answering +1

Paper
Add Code

Linguistically-aware Attention for Reducing the Semantic-Gap in Vision-Language Tasks

no code implementations • 18 Aug 2020 • Gouthaman KV, Athira Nambiar, Kancheti Sai Srinivas, Anurag Mittal

Humans perform such a correlation with a strong linguistic understanding of the visual world.

Image Captioning Visual Question Answering (VQA)

Paper
Add Code

Reducing Language Biases in Visual Question Answering with Visually-Grounded Question Encoder

no code implementations • ECCV 2020 • Gouthaman KV, Anurag Mittal

We demonstrate the effect of VGQE on three recent VQA models and achieve state-of-the-art results on the bias-sensitive split of the VQAv2 dataset; VQA-CPv2.

Question Answering Visual Grounding +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.