TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Text-To-SQL	spider	Graphix-3B+PICARD	Exact Match Accuracy (Dev)	77.1	# 3
Text-To-SQL	spider	Graphix-3B+PICARD	Execution Accuracy (Dev)	81.0	# 5
Text-To-SQL	spider	Graphix-3B+PICARD	Execution Accuracy (Test)	77.6	# 7
Semantic Parsing	spider	Graphix-3B + PICARD	Accuracy	74.0	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/graphix-t5-mixing-pre-trained-transformers/semantic-parsing-on-spider)](https://paperswithcode.com/sota/semantic-parsing-on-spider?p=graphix-t5-mixing-pre-trained-transformers)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/graphix-t5-mixing-pre-trained-transformers/text-to-sql-on-spider)](https://paperswithcode.com/sota/text-to-sql-on-spider?p=graphix-t5-mixing-pre-trained-transformers)`

Graphix-T5: Mixing Pre-Trained Transformers with Graph-Aware Layers for Text-to-SQL Parsing

18 Jan 2023 · Jinyang Li, Binyuan Hui, Reynold Cheng, Bowen Qin, Chenhao Ma, Nan Huo, Fei Huang, Wenyu Du, Luo Si, Yongbin Li ·

The task of text-to-SQL parsing, which aims at converting natural language questions into executable SQL queries, has garnered increasing attention in recent years, as it can assist end users in efficiently extracting vital information from databases without the need for technical background. One of the major challenges in text-to-SQL parsing is domain generalization, i.e., how to generalize well to unseen databases. Recently, the pre-trained text-to-text transformer model, namely T5, though not specialized for text-to-SQL parsing, has achieved state-of-the-art performance on standard benchmarks targeting domain generalization. In this work, we explore ways to further augment the pre-trained T5 model with specialized components for text-to-SQL parsing. Such components are expected to introduce structural inductive bias into text-to-SQL parsers thus improving model's capacity on (potentially multi-hop) reasoning, which is critical for generating structure-rich SQLs. To this end, we propose a new architecture GRAPHIX-T5, a mixed model with the standard pre-trained transformer model augmented by some specially-designed graph-aware layers. Extensive experiments and analysis demonstrate the effectiveness of GRAPHIX-T5 across four text-to-SQL benchmarks: SPIDER, SYN, REALISTIC and DK. GRAPHIX-T5 surpass all other T5-based parsers with a significant margin, achieving new state-of-the-art performance. Notably, GRAPHIX-T5-large reach performance superior to the original T5-large by 5.7% on exact match (EM) accuracy and 6.6% on execution accuracy (EX). This even outperforms the T5-3B by 1.2% on EM and 1.5% on EX.

PDF Abstract

Code

Add Remove Mark official

AlibabaResearch/DAMO-ConvAI

958

Tasks

Add Remove

Domain Generalization

Inductive Bias

Semantic Parsing

SQL Parsing

Text-To-SQL

Datasets

SPIDER

Spider Spider-Realistic

Results from the Paper

Edit

Ranked #4 on Semantic Parsing on spider

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Text-To-SQL	spider	Graphix-3B+PICARD	Exact Match Accuracy (Dev)	77.1	# 3	Compare
			Execution Accuracy (Dev)	81.0	# 5	Compare
			Execution Accuracy (Test)	77.6	# 7	Compare
Semantic Parsing	spider	Graphix-3B + PICARD	Accuracy	74.0	# 4	Compare

Methods

Add Remove

Adafactor • Attention Dropout • BPE • Dense Connections • Dropout • GELU • GLU • Inverse Square Root Schedule • Layer Normalization • Linear Layer • Multi-Head Attention • Residual Connection • Scaled Dot-Product Attention • SentencePiece • Softmax • T5

Edit Social Preview

Graphix-T5: Mixing Pre-Trained Transformers with Graph-Aware Layers for Text-to-SQL Parsing

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove