TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Task-Oriented Dialogue Systems	KVRET	T5-3b(UnifiedSKG)	Entity F1	70.07	# 1
Table-based Fact Verification	TabFact	T5-3b(UnifiedSKG)	Test	83.68	# 7
Table-based Fact Verification	TabFact	T5-3b(UnifiedSKG)	Val	83.97	# 4
Semantic Parsing	WikiTableQuestions	T5-3b(UnifiedSKG)	Accuracy (Dev)	50.65	# 8
Semantic Parsing	WikiTableQuestions	T5-3b(UnifiedSKG)	Accuracy (Test)	49.29	# 11

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/unifiedskg-unifying-and-multi-tasking/task-oriented-dialogue-systems-on-kvret)](https://paperswithcode.com/sota/task-oriented-dialogue-systems-on-kvret?p=unifiedskg-unifying-and-multi-tasking)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/unifiedskg-unifying-and-multi-tasking/table-based-fact-verification-on-tabfact)](https://paperswithcode.com/sota/table-based-fact-verification-on-tabfact?p=unifiedskg-unifying-and-multi-tasking)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/unifiedskg-unifying-and-multi-tasking/semantic-parsing-on-wikitablequestions)](https://paperswithcode.com/sota/semantic-parsing-on-wikitablequestions?p=unifiedskg-unifying-and-multi-tasking)`

UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models

16 Jan 2022 · Tianbao Xie, Chen Henry Wu, Peng Shi, Ruiqi Zhong, Torsten Scholak, Michihiro Yasunaga, Chien-Sheng Wu, Ming Zhong, Pengcheng Yin, Sida I. Wang, Victor Zhong, Bailin Wang, Chengzu Li, Connor Boyle, Ansong Ni, Ziyu Yao, Dragomir Radev, Caiming Xiong, Lingpeng Kong, Rui Zhang, Noah A. Smith, Luke Zettlemoyer, Tao Yu ·

Structured knowledge grounding (SKG) leverages structured knowledge to complete user requests, such as semantic parsing over databases and question answering over knowledge bases. Since the inputs and outputs of SKG tasks are heterogeneous, they have been studied separately by different communities, which limits systematic and compatible research on SKG. In this paper, we overcome this limitation by proposing the UnifiedSKG framework, which unifies 21 SKG tasks into a text-to-text format, aiming to promote systematic SKG research, instead of being exclusive to a single task, domain, or dataset. We use UnifiedSKG to benchmark T5 with different sizes and show that T5, with simple modifications when necessary, achieves state-of-the-art performance on almost all of the 21 tasks. We further demonstrate that multi-task prefix-tuning improves the performance on most tasks, largely improving the overall performance. UnifiedSKG also facilitates the investigation of zero-shot and few-shot learning, and we show that T0, GPT-3, and Codex struggle in zero-shot and few-shot learning for SKG. We also use UnifiedSKG to conduct a series of controlled experiments on structured knowledge encoding variants across SKG tasks. UnifiedSKG is easily extensible to more tasks, and it is open-sourced at https://github.com/hkunlp/unifiedskg.

PDF Abstract

Code

Add Remove Mark official

hkunlp/unifiedskg official

↳ Quickstart in

Colab

530

Tasks

Add Remove

Few-Shot Learning

Question Answering

Semantic Parsing

Table-based Fact Verification

Task-Oriented Dialogue Systems

Datasets

WikiSQL

TabFact

WikiTableQuestions MTOP HybridQA

SParC

SQA

KVRET

Results from the Paper

Edit

Ranked #1 on Task-Oriented Dialogue Systems on KVRET

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Task-Oriented Dialogue Systems	KVRET	T5-3b(UnifiedSKG)	Entity F1	70.07	# 1	Compare
Table-based Fact Verification	TabFact	T5-3b(UnifiedSKG)	Test	83.68	# 7	Compare
Table-based Fact Verification	TabFact	T5-3b(UnifiedSKG)	Val	83.97	# 4	Compare
Semantic Parsing	WikiTableQuestions	T5-3b(UnifiedSKG)	Accuracy (Dev)	50.65	# 8	Compare
Semantic Parsing	WikiTableQuestions	T5-3b(UnifiedSKG)	Accuracy (Test)	49.29	# 11	Compare

Methods

Add Remove

Adafactor • Adam • Attention Dropout • BPE • Cosine Annealing • Dense Connections • Dropout • Fixed Factorized Attention • GELU • GLU • GPT-3 • Inverse Square Root Schedule • Layer Normalization • Linear Layer • Linear Warmup With Cosine Annealing • Multi-Head Attention • Residual Connection • Scaled Dot-Product Attention • SentencePiece • Softmax • Strided Attention • T5 • Weight Decay

Edit Social Preview

UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove