TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Text-To-SQL	spider	T5-3B+NatSQL+Token Preprocessing	Exact Match Accuracy (Dev)	69.4	# 8
Text-To-SQL	spider	T5-3B+NatSQL+Token Preprocessing	Execution Accuracy (Dev)	73.7	# 6
Text-To-SQL	spider	T5-3B+NatSQL+Token Preprocessing	Execution Accuracy (Test)	78	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/improving-generalization-in-language-model/text-to-sql-on-spider)](https://paperswithcode.com/sota/text-to-sql-on-spider?p=improving-generalization-in-language-model)`

Improving Generalization in Language Model-Based Text-to-SQL Semantic Parsing: Two Simple Semantic Boundary-Based Techniques

27 May 2023 · Daking Rai, Bailin Wang, Yilun Zhou, Ziyu Yao ·

Compositional and domain generalization present significant challenges in semantic parsing, even for state-of-the-art semantic parsers based on pre-trained language models (LMs). In this study, we empirically investigate improving an LM's generalization in semantic parsing with two simple techniques: at the token level, we introduce a token preprocessing method to preserve the semantic boundaries of tokens produced by LM tokenizers; at the sequence level, we propose to use special tokens to mark the boundaries of components aligned between input and output. Our experimental results on two text-to-SQL semantic parsing datasets show that our token preprocessing, although simple, can substantially improve the LM performance on both types of generalization, and our component boundary marking method is particularly helpful for compositional generalization.

PDF Abstract

Code

Add Remove Mark official

dakingrai/ood-generalization-semant… official

Tasks

Add Remove

Domain Generalization

Language Modelling

Semantic Parsing

Text-To-SQL

Datasets

Spider-Realistic

Results from the Paper

Edit

Ranked #6 on Text-To-SQL on spider

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Text-To-SQL	spider	T5-3B+NatSQL+Token Preprocessing	Exact Match Accuracy (Dev)	69.4	# 8	Compare
			Execution Accuracy (Dev)	73.7	# 6	Compare
			Execution Accuracy (Test)	78	# 6	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Improving Generalization in Language Model-Based Text-to-SQL Semantic Parsing: Two Simple Semantic Boundary-Based Techniques

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove