TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Table-based Fact Verification	TabFact	Dater	Test	93.0	# 1
Semantic Parsing	WikiTableQuestions	Dater	Accuracy (Dev)	64.8	# 2
Semantic Parsing	WikiTableQuestions	Dater	Accuracy (Test)	65.9	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/large-language-models-are-versatile/table-based-fact-verification-on-tabfact)](https://paperswithcode.com/sota/table-based-fact-verification-on-tabfact?p=large-language-models-are-versatile)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/large-language-models-are-versatile/semantic-parsing-on-wikitablequestions)](https://paperswithcode.com/sota/semantic-parsing-on-wikitablequestions?p=large-language-models-are-versatile)`

Large Language Models are Versatile Decomposers: Decompose Evidence and Questions for Table-based Reasoning

31 Jan 2023 · Yunhu Ye, Binyuan Hui, Min Yang, Binhua Li, Fei Huang, Yongbin Li ·

Table-based reasoning has shown remarkable progress in combining deep models with discrete reasoning, which requires reasoning over both free-form natural language (NL) questions and structured tabular data. However, previous table-based reasoning solutions usually suffer from significant performance degradation on huge evidence (tables). In addition, most existing methods struggle to reason over complex questions since the required information is scattered in different places. To alleviate the above challenges, we exploit large language models (LLMs) as decomposers for effective table-based reasoning, which (i) decompose huge evidence (a huge table) into sub-evidence (a small table) to mitigate the interference of useless information for table reasoning; and (ii) decompose complex questions into simpler sub-questions for text reasoning. Specifically, we first use the LLMs to break down the evidence (tables) involved in the current question, retaining the relevant evidence and excluding the remaining irrelevant evidence from the huge table. In addition, we propose a "parsing-execution-filling" strategy to alleviate the hallucination dilemma of the chain of thought by decoupling logic and numerical computation in each step. Extensive experiments show that our method can effectively leverage decomposed evidence and questions and outperforms the strong baselines on TabFact, WikiTableQuestion, and FetaQA datasets. Notably, our model outperforms human performance for the first time on the TabFact dataset.

PDF Abstract

Code

Add Remove Mark official

alibabaresearch/damo-convai official

977

Tasks

Add Remove

Hallucination

Semantic Parsing

Table-based Fact Verification

Datasets

TabFact

WikiTableQuestions

Results from the Paper

Edit

Ranked #1 on Table-based Fact Verification on TabFact

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Table-based Fact Verification	TabFact	Dater	Test	93.0	# 1	Compare
Semantic Parsing	WikiTableQuestions	Dater	Accuracy (Dev)	64.8	# 2	Compare
Semantic Parsing	WikiTableQuestions	Dater	Accuracy (Test)	65.9	# 4	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Large Language Models are Versatile Decomposers: Decompose Evidence and Questions for Table-based Reasoning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove