Browse > Reasoning > Visual Reasoning

Visual Reasoning

29 papers with code · Reasoning

Leaderboards

Latest papers with code

Cross-Modality Relevance for Reasoning on Language and Vision

12 May 2020HLR/Cross_Modality_Relevance

This work deals with the challenge of learning and reasoning over language and vision data for the related downstream tasks such as visual question answering (VQA) and natural language for visual reasoning (NLVR).

QUESTION ANSWERING VISUAL QUESTION ANSWERING VISUAL REASONING

3
12 May 2020

Differentiable Adaptive Computation Time for Visual Reasoning

27 Apr 2020ceyzaguirre4/DACT-MAC

This paper presents a novel attention-based algorithm for achieving adaptive computation called DACT, which, unlike existing ones, is end-to-end differentiable.

VISUAL REASONING

5
27 Apr 2020

Smart Home Appliances: Chat with Your Fridge

19 Dec 2019gudovskiy/fridge-demo

Current home appliances are capable to execute a limited number of voice commands such as turning devices on or off, adjusting music volume or light conditions.

VISUAL REASONING

1
19 Dec 2019

Are Disentangled Representations Helpful for Abstract Visual Reasoning?

NeurIPS 2019 google-research/disentanglement_lib

A disentangled representation encodes information about the salient factors of variation in the data independently.

VISUAL REASONING

863
01 Dec 2019

Learning by Abstraction: The Neural State Machine

NeurIPS 2019 stanfordnlp/mac-network

We introduce the Neural State Machine, seeking to bridge the gap between the neural and symbolic views of AI and integrate their complementary strengths for the task of visual reasoning.

VISUAL QUESTION ANSWERING VISUAL REASONING

413
01 Dec 2019

Learning Dynamics of Attention: Human Prior for Interpretable Machine Reasoning

NeurIPS 2019 kakao/DAFT

Without relevant human priors, neural networks may learn uninterpretable features.

VISUAL REASONING

25
01 Dec 2019

Temporal Reasoning via Audio Question Answering

21 Nov 2019facebookresearch/daqa

In this paper, we use the task of Audio Question Answering (AQA) to study the temporal reasoning abilities of machine learning models.

AUDIO QUESTION ANSWERING QUESTION ANSWERING READING COMPREHENSION VISUAL QUESTION ANSWERING VISUAL REASONING

10
21 Nov 2019

LXMERT: Learning Cross-Modality Encoder Representations from Transformers

IJCNLP 2019 airsplay/lxmert

In LXMERT, we build a large-scale Transformer model that consists of three encoders: an object relationship encoder, a language encoder, and a cross-modality encoder.

LANGUAGE MODELLING QUESTION ANSWERING VISUAL QUESTION ANSWERING VISUAL REASONING

375
20 Aug 2019

VisualBERT: A Simple and Performant Baseline for Vision and Language

9 Aug 2019uclanlp/visualbert

We propose VisualBERT, a simple and flexible framework for modeling a broad range of vision-and-language tasks.

LANGUAGE MODELLING VISUAL QUESTION ANSWERING VISUAL REASONING

174
09 Aug 2019

Learning by Abstraction: The Neural State Machine

NeurIPS 2019 ceyzaguirre4/NSM

We introduce the Neural State Machine, seeking to bridge the gap between the neural and symbolic views of AI and integrate their complementary strengths for the task of visual reasoning.

VISUAL QUESTION ANSWERING VISUAL REASONING

33
09 Jul 2019