Browse > Reasoning > Visual Reasoning

Visual Reasoning

20 papers with code · Reasoning

State-of-the-art leaderboards

Latest papers with code

LXMERT: Learning Cross-Modality Encoder Representations from Transformers

20 Aug 2019airsplay/lxmert

In LXMERT, we build a large-scale Transformer model that consists of three encoders: an object relationship encoder, a language encoder, and a cross-modality encoder.

LANGUAGE MODELLING QUESTION ANSWERING VISUAL QUESTION ANSWERING VISUAL REASONING

160
20 Aug 2019

VisualBERT: A Simple and Performant Baseline for Vision and Language

9 Aug 2019uclanlp/visualbert

We propose VisualBERT, a simple and flexible framework for modeling a broad range of vision-and-language tasks.

LANGUAGE MODELLING VISUAL QUESTION ANSWERING VISUAL REASONING

92
09 Aug 2019

Learning by Abstraction: The Neural State Machine

9 Jul 2019ceyzaguirre4/NSM

We introduce the Neural State Machine, seeking to bridge the gap between the neural and symbolic views of AI and integrate their complementary strengths for the task of visual reasoning.

VISUAL QUESTION ANSWERING VISUAL REASONING

13
09 Jul 2019

Learning Dynamics of Attention: Human Prior for Interpretable Machine Reasoning

28 May 2019kakao/DAFT

Without relevant human priors, neural networks may learn uninterpretable features.

VISUAL REASONING

16
28 May 2019

GQA: A New Dataset for Real-World Visual Reasoning and Compositional Question Answering

CVPR 2019 kakao/DAFT

We introduce GQA, a new dataset for real-world visual reasoning and compositional question answering, seeking to address key shortcomings of previous VQA datasets.

QUESTION ANSWERING VISUAL QUESTION ANSWERING VISUAL REASONING

16
25 Feb 2019

When Causal Intervention Meets Adversarial Examples and Image Masking for Deep Neural Networks

9 Feb 2019jjaacckkyy63/Causal-Intervention-AE-wAdvImg

To study the intervention effects on pixel-level features for causal reasoning, we introduce pixel-wise masking and adversarial perturbation.

CAUSAL INFERENCE VISUAL REASONING

1
09 Feb 2019

Explainable and Explicit Visual Reasoning over Scene Graphs

CVPR 2019 shijx12/XNM-Net

We aim to dismantle the prevalent black-box neural architectures used in complex visual reasoning tasks, into the proposed eXplainable and eXplicit Neural Modules (XNMs), which advance beyond existing neural module networks towards using scene graphs --- objects as nodes and the pairwise relationships as edges --- for explainable and explicit reasoning with structured knowledge.

VISUAL REASONING

56
05 Dec 2018

A Corpus for Reasoning About Natural Language Grounded in Photographs

ACL 2019 vortexJCH/nlvr

We crowdsource the data using sets of visually rich images and a compare-and-contrast task to elicit linguistically diverse language.

VISUAL REASONING

1
01 Nov 2018

Mapping Natural Language Commands to Web Elements

EMNLP 2018 stanfordnlp/phrasenode

The web provides a rich, open-domain environment with textual, structural, and spatial properties.

RELATIONAL REASONING VISUAL REASONING

19
28 Aug 2018