Browse > Reasoning > Visual Reasoning

Visual Reasoning

28 papers with code ยท Reasoning

Leaderboards

Latest papers without code

Retrieving and Highlighting Action with Spatiotemporal Reference

19 May 2020

In this paper, we present a framework that jointly retrieves and spatiotemporally highlights actions in videos by enhancing current deep cross-modal retrieval methods.

CROSS-MODAL RETRIEVAL VISUAL REASONING

Cross-Modality Relevance for Reasoning on Language and Vision

12 May 2020

This work deals with the challenge of learning and reasoning over language and vision data for the related downstream tasks such as visual question answering (VQA) and natural language for visual reasoning (NLVR).

QUESTION ANSWERING VISUAL QUESTION ANSWERING VISUAL REASONING

Dynamic Language Binding in Relational Visual Reasoning

30 Apr 2020

We present Language-binding Object Graph Network, the first neural reasoning method with dynamic relational structures across both visual and textual domains with applications in visual question answering.

QUESTION ANSWERING VISUAL QUESTION ANSWERING VISUAL REASONING

Machine Number Sense: A Dataset of Visual Arithmetic Problems for Abstract and Relational Reasoning

25 Apr 2020

To endow such a crucial cognitive ability to machine intelligence, we propose a dataset, Machine Number Sense (MNS), consisting of visual arithmetic problems automatically generated using a grammar model--And-Or Graph (AOG).

RELATIONAL REASONING VISUAL REASONING

SHOP-VRB: A Visual Reasoning Benchmark for Object Perception

6 Apr 2020

In this paper we present an approach and a benchmark for visual reasoning in robotics applications, in particular small object grasping and manipulation.

VISUAL REASONING

Pixel-BERT: Aligning Image Pixels with Text by Deep Multi-Modal Transformers

2 Apr 2020

We aim to build a more accurate and thorough connection between image pixels and language semantics directly from image and sentence pairs instead of using region-based image features as the most recent vision and language tasks.

LANGUAGE MODELLING QUESTION ANSWERING TEXT MATCHING VISUAL QUESTION ANSWERING VISUAL REASONING

TextCaps: a Dataset for Image Captioning with Reading Comprehension

24 Mar 2020

Image descriptions can help visually impaired people to quickly understand the image content.

IMAGE CAPTIONING OPTICAL CHARACTER RECOGNITION READING COMPREHENSION VISUAL REASONING

Learning Rope Manipulation Policies Using Dense Object Descriptors Trained on Synthetic Depth Data

3 Mar 2020

We address these challenges using interpretable deep visual representations for rope, extending recent work on dense object descriptors for robot manipulation.

VISUAL REASONING

Cops-Ref: A new Dataset and Task on Compositional Referring Expression Comprehension

1 Mar 2020

To bridge the gap, we propose a new dataset for visual reasoning in context of referring expression comprehension with two main features.

VISUAL REASONING

Hierarchical Rule Induction Network for Abstract Visual Reasoning

17 Feb 2020

Abstract reasoning refers to the ability to analyze information, discover rules at an intangible level, and solve problems in innovative ways.

RELATION EXTRACTION VISUAL REASONING