TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Vision and Language Navigation	Touchdown Dataset	RConcat	Task Completion (TC)	10.7	# 10
Vision and Language Navigation	Touchdown Dataset	Gated Attention (GA)	Task Completion (TC)	5.5	# 11

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/touchdown-natural-language-navigation-and/vision-and-language-navigation-on-touchdown)](https://paperswithcode.com/sota/vision-and-language-navigation-on-touchdown?p=touchdown-natural-language-navigation-and)`

Touchdown: Natural Language Navigation and Spatial Reasoning in Visual Street Environments

CVPR 2019 · Howard Chen, Alane Suhr, Dipendra Misra, Noah Snavely, Yoav Artzi ·

We study the problem of jointly reasoning about language and vision through a navigation and spatial reasoning task. We introduce the Touchdown task and dataset, where an agent must first follow navigation instructions in a real-life visual urban environment, and then identify a location described in natural language to find a hidden object at the goal position. The data contains 9,326 examples of English instructions and spatial descriptions paired with demonstrations. Empirical analysis shows the data presents an open challenge to existing methods, and qualitative linguistic analysis shows that the data displays richer use of spatial reasoning compared to related resources.

PDF Abstract CVPR 2019 PDF CVPR 2019 Abstract

Code

Add Remove Mark official

lil-lab/touchdown official

lil-lab/ciff

clic-lab/ciff

VegB/VLN-Transformer

Tasks

Add Remove

Position

Vision and Language Navigation

Datasets

Introduced in the Paper:

Touchdown Dataset

Used in the Paper:

Visual Question Answering

ReferItGame Google Refexp

Talk the Walk

Results from the Paper

Add Remove

Ranked #10 on Vision and Language Navigation on Touchdown Dataset

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Vision and Language Navigation	Touchdown Dataset	RConcat	Task Completion (TC)	10.7	# 10	Compare
Vision and Language Navigation	Touchdown Dataset	Gated Attention (GA)	Task Completion (TC)	5.5	# 11	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Touchdown: Natural Language Navigation and Spatial Reasoning in Visual Street Environments

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove