Browse > Reasoning > Visual Reasoning > Visual Commonsense Reasoning

Visual Commonsense Reasoning

2 papers with code · Reasoning
Subtask of Visual Reasoning

State-of-the-art leaderboards

No evaluation results yet. Help compare methods by submit evaluation metrics.

Greatest papers with code

From Recognition to Cognition: Visual Commonsense Reasoning

CVPR 2019 rowanz/r2c

While this task is easy for humans, it is tremendously difficult for today's vision systems, requiring higher-order cognition and commonsense reasoning about the world.

VISUAL COMMONSENSE REASONING

ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks

6 Aug 2019jiasenlu/vilbert_beta

We present ViLBERT (short for Vision-and-Language BERT), a model for learning task-agnostic joint representations of image content and natural language.

IMAGE RETRIEVAL QUESTION ANSWERING VISUAL COMMONSENSE REASONING VISUAL QUESTION ANSWERING