TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Table-based Fact Verification	TabFact	Binder	Test	86.0	# 4
Table-based Fact Verification	TabFact	Binder	Val	-	# 9
Semantic Parsing	WikiTableQuestions	Binder	Accuracy (Dev)	65.0	# 1
Semantic Parsing	WikiTableQuestions	Binder	Accuracy (Test)	64.6	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/binding-language-models-in-symbolic-languages/table-based-fact-verification-on-tabfact)](https://paperswithcode.com/sota/table-based-fact-verification-on-tabfact?p=binding-language-models-in-symbolic-languages)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/binding-language-models-in-symbolic-languages/semantic-parsing-on-wikitablequestions)](https://paperswithcode.com/sota/semantic-parsing-on-wikitablequestions?p=binding-language-models-in-symbolic-languages)`

Binding Language Models in Symbolic Languages

6 Oct 2022 · Zhoujun Cheng, Tianbao Xie, Peng Shi, Chengzu Li, Rahul Nadkarni, Yushi Hu, Caiming Xiong, Dragomir Radev, Mari Ostendorf, Luke Zettlemoyer, Noah A. Smith, Tao Yu ·

Though end-to-end neural approaches have recently been dominating NLP tasks in both performance and ease-of-use, they lack interpretability and robustness. We propose Binder, a training-free neural-symbolic framework that maps the task input to a program, which (1) allows binding a unified API of language model (LM) functionalities to a programming language (e.g., SQL, Python) to extend its grammar coverage and thus tackle more diverse questions, (2) adopts an LM as both the program parser and the underlying model called by the API during execution, and (3) requires only a few in-context exemplar annotations. Specifically, we employ GPT-3 Codex as the LM. In the parsing stage, with only a few in-context exemplars, Codex is able to identify the part of the task input that cannot be answerable by the original programming language, correctly generate API calls to prompt Codex to solve the unanswerable part, and identify where to place the API calls while being compatible with the original grammar. In the execution stage, Codex can perform versatile functionalities (e.g., commonsense QA, information extraction) given proper prompts in the API calls. Binder achieves state-of-the-art results on WikiTableQuestions and TabFact datasets, with explicit output programs that benefit human debugging. Note that previous best systems are all finetuned on tens of thousands of task-specific samples, while Binder only uses dozens of annotations as in-context exemplars without any training. Our code is available at https://github.com/HKUNLP/Binder .

PDF Abstract

Code

Add Remove Mark official

hkunlp/binder official

↳ Quickstart in

Spaces

275

Tasks

Add Remove

Language Modelling

Semantic Parsing

Table-based Fact Verification

Datasets

TabFact

WikiTableQuestions

Results from the Paper

Edit

Ranked #4 on Table-based Fact Verification on TabFact

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Table-based Fact Verification	TabFact	Binder	Test	86.0	# 4	Compare
Table-based Fact Verification	TabFact	Binder	Val	-	# 9	Compare
Semantic Parsing	WikiTableQuestions	Binder	Accuracy (Dev)	65.0	# 1	Compare
Semantic Parsing	WikiTableQuestions	Binder	Accuracy (Test)	64.6	# 6	Compare

Methods

Add Remove

Adam • Attention Dropout • BPE • Cosine Annealing • Dense Connections • Dropout • Fixed Factorized Attention • GELU • GPT-3 • Layer Normalization • Linear Layer • Linear Warmup With Cosine Annealing • Multi-Head Attention • Residual Connection • Scaled Dot-Product Attention • Softmax • Strided Attention • Weight Decay

Edit Social Preview

Binding Language Models in Symbolic Languages

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove