Browse > Reasoning > Visual Reasoning > Visual Commonsense Reasoning

Visual Commonsense Reasoning

5 papers with code · Reasoning
Subtask of Visual Reasoning

Leaderboards

No evaluation results yet. Help compare methods by submit evaluation metrics.

Greatest papers with code

From Recognition to Cognition: Visual Commonsense Reasoning

CVPR 2019 rowanz/r2c

While this task is easy for humans, it is tremendously difficult for today's vision systems, requiring higher-order cognition and commonsense reasoning about the world.

VISUAL COMMONSENSE REASONING

ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks

NeurIPS 2019 jiasenlu/vilbert_beta

We present ViLBERT (short for Vision-and-Language BERT), a model for learning task-agnostic joint representations of image content and natural language.

IMAGE RETRIEVAL QUESTION ANSWERING VISUAL COMMONSENSE REASONING VISUAL QUESTION ANSWERING

VL-BERT: Pre-training of Generic Visual-Linguistic Representations

22 Aug 2019jackroos/VL-BERT

We introduce a new pre-trainable generic representation for visual-linguistic tasks, called Visual-Linguistic BERT (VL-BERT for short).

LANGUAGE MODELLING QUESTION ANSWERING VISUAL COMMONSENSE REASONING VISUAL QUESTION ANSWERING

Heterogeneous Graph Learning for Visual Commonsense Reasoning

NeurIPS 2019 yuweijiang/HGL-pytorch

Our HGL consists of a primal vision-to-answer heterogeneous graph (VAHG) module and a dual question-to-answer heterogeneous graph (QAHG) module to interactively refine reasoning paths for semantic agreement.

VISUAL COMMONSENSE REASONING

Connective Cognition Network for Directional Visual Commonsense Reasoning

NeurIPS 2019 AmingWu/CCN

Inspired by this idea, towards VCR, we propose a connective cognition network (CCN) to dynamically reorganize the visual neuron connectivity that is contextualized by the meaning of questions and answers.

VISUAL COMMONSENSE REASONING