Browse > Computer Vision > Visual Question Answering

Visual Question Answering

141 papers with code ยท Computer Vision

Leaderboards

TREND DATASET BEST METHOD PAPER TITLE PAPER CODE COMPARE

Latest papers without code

On the Value of Out-of-Distribution Testing: An Example of Goodhart's Law

19 May 2020

Out-of-distribution (OOD) testing is increasingly popular for evaluating a machine learning system's ability to generalize beyond the biases of a training set.

MODEL SELECTION QUESTION ANSWERING VISUAL QUESTION ANSWERING

Visual Relationship Detection using Scene Graphs: A Survey

16 May 2020

In this paper, we present a detailed survey on the various techniques for scene graph generation, their efficacy to represent visual relationships and how it has been used to solve various downstream tasks.

GRAPH GENERATION IMAGE GENERATION IMAGE RETRIEVAL OBJECT RECOGNITION QUESTION ANSWERING SCENE GRAPH GENERATION VISUAL QUESTION ANSWERING VISUAL RELATIONSHIP DETECTION

Cross-Modality Relevance for Reasoning on Language and Vision

12 May 2020

This work deals with the challenge of learning and reasoning over language and vision data for the related downstream tasks such as visual question answering (VQA) and natural language for visual reasoning (NLVR).

QUESTION ANSWERING VISUAL QUESTION ANSWERING VISUAL REASONING

COBRA: Contrastive Bi-Modal Representation Algorithm

7 May 2020

In this paper, we present a novel framework COBRA that aims to train two modalities (image and text) in a joint fashion inspired by the Contrastive Predictive Coding (CPC) and Noise Contrastive Estimation (NCE) paradigms which preserve both inter and intra-class relationships.

CROSS-MODAL RETRIEVAL IMAGE CAPTIONING QUESTION ANSWERING VISUAL QUESTION ANSWERING

Visual Question Answering with Prior Class Semantics

4 May 2020

We present a novel mechanism to embed prior knowledge in a model for visual question answering.

QUESTION ANSWERING VISUAL QUESTION ANSWERING WORD EMBEDDINGS

Diverse Visuo-Lingustic Question Answering (DVLQA) Challenge

1 May 2020

Existing question answering datasets mostly contain homogeneous contexts, based on either textual or visual information alone.

QUESTION ANSWERING READING COMPREHENSION VISUAL QUESTION ANSWERING

Dynamic Language Binding in Relational Visual Reasoning

30 Apr 2020

We present Language-binding Object Graph Network, the first neural reasoning method with dynamic relational structures across both visual and textual domains with applications in visual question answering.

QUESTION ANSWERING VISUAL QUESTION ANSWERING VISUAL REASONING

Pragmatic Issue-Sensitive Image Captioning

29 Apr 2020

Image captioning systems have recently improved dramatically, but they still tend to produce captions that are insensitive to the communicative goals that captions should meet.

IMAGE CAPTIONING QUESTION ANSWERING VISUAL QUESTION ANSWERING

A Novel Attention-based Aggregation Function to Combine Vision and Language

27 Apr 2020

The joint understanding of vision and language has been recently gaining a lot of attention in both the Computer Vision and Natural Language Processing communities, with the emergence of tasks such as image captioning, image-text matching, and visual question answering.

IMAGE CAPTIONING QUESTION ANSWERING TEXT MATCHING VISUAL QUESTION ANSWERING

Deep Multimodal Neural Architecture Search

25 Apr 2020

Most existing works focus on a single task and design neural architectures manually, which are highly task-specific and hard to generalize to different tasks.

NEURAL ARCHITECTURE SEARCH QUESTION ANSWERING TEXT MATCHING VISUAL QUESTION ANSWERING