TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Semantic Parsing	SQA	TAPEX-Large	Denotation Accuracy	74.5	# 1
Table-based Fact Verification	TabFact	TAPEX-Large	Test	84.2	# 6
Table-based Fact Verification	TabFact	TAPEX-Large	Val	84.6	# 2
Semantic Parsing	WikiSQL	TAPEX-Large (weak supervision)	Denotation accuracy (test)	89.5	# 1
Semantic Parsing	WikiTableQuestions	TAPEX-Large	Accuracy (Dev)	57.0	# 6
Semantic Parsing	WikiTableQuestions	TAPEX-Large	Accuracy (Test)	57.5	# 9

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/tapex-table-pre-training-via-learning-a/semantic-parsing-on-sqa)](https://paperswithcode.com/sota/semantic-parsing-on-sqa?p=tapex-table-pre-training-via-learning-a)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/tapex-table-pre-training-via-learning-a/semantic-parsing-on-wikisql-1)](https://paperswithcode.com/sota/semantic-parsing-on-wikisql-1?p=tapex-table-pre-training-via-learning-a)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/tapex-table-pre-training-via-learning-a/table-based-fact-verification-on-tabfact)](https://paperswithcode.com/sota/table-based-fact-verification-on-tabfact?p=tapex-table-pre-training-via-learning-a)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/tapex-table-pre-training-via-learning-a/semantic-parsing-on-wikitablequestions)](https://paperswithcode.com/sota/semantic-parsing-on-wikitablequestions?p=tapex-table-pre-training-via-learning-a)`

TAPEX: Table Pre-training via Learning a Neural SQL Executor

ICLR 2022 · Qian Liu, Bei Chen, Jiaqi Guo, Morteza Ziyadi, Zeqi Lin, Weizhu Chen, Jian-Guang Lou ·

Recent progress in language model pre-training has achieved a great success via leveraging large-scale unstructured textual data. However, it is still a challenge to apply pre-training on structured tabular data due to the absence of large-scale high-quality tabular data. In this paper, we propose TAPEX to show that table pre-training can be achieved by learning a neural SQL executor over a synthetic corpus, which is obtained by automatically synthesizing executable SQL queries and their execution outputs. TAPEX addresses the data scarcity challenge via guiding the language model to mimic a SQL executor on the diverse, large-scale and high-quality synthetic corpus. We evaluate TAPEX on four benchmark datasets. Experimental results demonstrate that TAPEX outperforms previous table pre-training approaches by a large margin and achieves new state-of-the-art results on all of them. This includes the improvements on the weakly-supervised WikiSQL denotation accuracy to 89.5% (+2.3%), the WikiTableQuestions denotation accuracy to 57.5% (+4.8%), the SQA denotation accuracy to 74.5% (+3.5%), and the TabFact accuracy to 84.2% (+3.2%). To our knowledge, this is the first work to exploit table pre-training via synthetic executable programs and to achieve new state-of-the-art results on various downstream tasks. Our code can be found at https://github.com/microsoft/Table-Pretraining.

PDF Abstract ICLR 2022 PDF ICLR 2022 Abstract

Code

Add Remove Mark official

microsoft/Table-Pretraining official

274

sohanpatnaik106/cabinet_qa

Tasks

Add Remove

Language Modelling

Semantic Parsing

Table-based Fact Verification

Datasets

WikiSQL

TabFact

WikiTableQuestions

SQA

Results from the Paper

Add Remove

Ranked #1 on Semantic Parsing on WikiSQL (Denotation accuracy (test) metric)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Semantic Parsing	SQA	TAPEX-Large	Denotation Accuracy	74.5	# 1	Compare
Table-based Fact Verification	TabFact	TAPEX-Large	Test	84.2	# 6	Compare
Table-based Fact Verification	TabFact	TAPEX-Large	Val	84.6	# 2	Compare
Semantic Parsing	WikiSQL	TAPEX-Large (weak supervision)	Denotation accuracy (test)	89.5	# 1	Compare
Semantic Parsing	WikiTableQuestions	TAPEX-Large	Accuracy (Dev)	57.0	# 6	Compare
Semantic Parsing	WikiTableQuestions	TAPEX-Large	Accuracy (Test)	57.5	# 9	Compare

Methods

Add Remove

TAPEX

Edit Social Preview

TAPEX: Table Pre-training via Learning a Neural SQL Executor

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove