Browse > Computer Vision > Vision-Language Navigation

Vision-Language Navigation

5 papers with code · Computer Vision

Vision-language navigation (VLN) is the task of navigating an embodied agent to carry out natural language instructions inside real 3D environments.

State-of-the-art leaderboards

Latest papers with code

Tactical Rewind: Self-Correction via Backtracking in Vision-and-Language Navigation

CVPR 2019 Kelym/FAST

We present the Frontier Aware Search with backTracking (FAST) Navigator, a general framework for action decoding, that achieves state-of-the-art results on the Room-to-Room (R2R) Vision-and-Language navigation challenge of Anderson et.

VISION-LANGUAGE NAVIGATION

31
06 Mar 2019

The Regretful Navigation Agent for Vision-and-Language Navigation

CVPR 2019 (Oral) 2019 chihyaoma/regretful-agent

As deep learning continues to make progress for challenging perception tasks, there is increased interest in combining vision, language, and decision-making.

DECISION MAKING VISION-LANGUAGE NAVIGATION VISUAL NAVIGATION

80
05 Mar 2019

The Regretful Agent: Heuristic-Aided Navigation through Progress Estimation

CVPR 2019 chihyaoma/regretful-agent

As deep learning continues to make progress for challenging perception tasks, there is increased interest in combining vision, language, and decision-making.

DECISION MAKING VISION-LANGUAGE NAVIGATION VISUAL NAVIGATION

80
05 Mar 2019

Self-Monitoring Navigation Agent via Auxiliary Progress Estimation

ICLR 2019 chihyaoma/selfmonitoring-agent

The Vision-and-Language Navigation (VLN) task entails an agent following navigational instruction in photo-realistic unknown environments.

NATURAL LANGUAGE VISUAL GROUNDING VISION-LANGUAGE NAVIGATION VISUAL NAVIGATION

77
10 Jan 2019

Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation

CVPR 2019 extreme-assistant/cvpr2019

Vision-language navigation (VLN) is the task of navigating an embodied agent to carry out natural language instructions inside real 3D environments.

IMITATION LEARNING VISION-LANGUAGE NAVIGATION

3,509
25 Nov 2018