ReasTAP: Injecting Table Reasoning Skills During Pre-training via Synthetic Reasoning Examples

22 Oct 2022  ·  Yilun Zhao, Linyong Nan, Zhenting Qi, Rui Zhang, Dragomir Radev ·

Reasoning over tabular data requires both table structure understanding and a broad set of table reasoning skills. Current models with table-specific architectures and pre-training methods perform well on understanding table structures, but they still struggle with tasks that require various table reasoning skills. In this work, we develop ReasTAP to show that high-level table reasoning skills can be injected into models during pre-training without a complex table-specific architecture design. We define 7 table reasoning skills, such as numerical operation, temporal comparison, and conjunction. Each reasoning skill is associated with one example generator, which synthesizes questions over semi-structured tables according to the sampled templates. We model the table pre-training task as a sequence generation task and pre-train ReasTAP to generate precise answers to the synthetic examples. ReasTAP is evaluated on four benchmarks covering three downstream tasks including: 1) WikiSQL and WTQ for Table Question Answering; 2) TabFact for Table Fact Verification; and 3) LogicNLG for Faithful Table-to-Text Generation. Experimental results demonstrate that ReasTAP achieves new state-of-the-art performance on all benchmarks and delivers a significant improvement on low-resource setting. Our code is publicly available at https://github.com/Yale-LILY/ReasTAP.

PDF Abstract

Results from the Paper


Ranked #3 on Semantic Parsing on WikiSQL (Denotation accuracy (test) metric)

     Get a GitHub badge
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Table-based Fact Verification TabFact ReasTAP-Large Test 84.9 # 5
Val 84.6 # 2
Semantic Parsing WikiSQL ReasTAP-Large (weak supervision) Denotation accuracy (test) 89.2 # 3
Semantic Parsing WikiTableQuestions ReasTAP-Large Accuracy (Dev) 59.7 # 5
Accuracy (Test) 58.7 # 8

Methods


No methods listed for this paper. Add relevant methods here