TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Grammatical Error Correction	BEA-2019 (test)	LM-Critic	F0.5	72.9	# 9
Grammatical Error Correction	CoNLL-2014 Shared Task	LM-Critic	F0.5	65.8	# 7
Grammatical Error Correction	Restricted	+ BIFI with no critic	F0.5	18.7	# 4
Grammatical Error Correction	Unrestricted	+ BIFI (ours)	F0.5	42.4	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/lm-critic-language-models-for-unsupervised/grammatical-error-correction-on-unrestricted)](https://paperswithcode.com/sota/grammatical-error-correction-on-unrestricted?p=lm-critic-language-models-for-unsupervised)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/lm-critic-language-models-for-unsupervised/grammatical-error-correction-on-restricted)](https://paperswithcode.com/sota/grammatical-error-correction-on-restricted?p=lm-critic-language-models-for-unsupervised)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/lm-critic-language-models-for-unsupervised/grammatical-error-correction-on-conll-2014)](https://paperswithcode.com/sota/grammatical-error-correction-on-conll-2014?p=lm-critic-language-models-for-unsupervised)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/lm-critic-language-models-for-unsupervised/grammatical-error-correction-on-bea-2019-test)](https://paperswithcode.com/sota/grammatical-error-correction-on-bea-2019-test?p=lm-critic-language-models-for-unsupervised)`

LM-Critic: Language Models for Unsupervised Grammatical Error Correction

EMNLP 2021 · Michihiro Yasunaga, Jure Leskovec, Percy Liang ·

Training a model for grammatical error correction (GEC) requires a set of labeled ungrammatical / grammatical sentence pairs, but manually annotating such pairs can be expensive. Recently, the Break-It-Fix-It (BIFI) framework has demonstrated strong results on learning to repair a broken program without any labeled examples, but this relies on a perfect critic (e.g., a compiler) that returns whether an example is valid or not, which does not exist for the GEC task. In this work, we show how to leverage a pretrained language model (LM) in defining an LM-Critic, which judges a sentence to be grammatical if the LM assigns it a higher probability than its local perturbations. We apply this LM-Critic and BIFI along with a large set of unlabeled sentences to bootstrap realistic ungrammatical / grammatical pairs for training a corrector. We evaluate our approach on GEC datasets across multiple domains (CoNLL-2014, BEA-2019, GMEG-wiki and GMEG-yahoo) and show that it outperforms existing methods in both the unsupervised setting (+7.7 F0.5) and the supervised setting (+0.5 F0.5).

PDF Abstract EMNLP 2021 PDF EMNLP 2021 Abstract

Code

Add Remove Mark official

michiyasunaga/LM-Critic official

117

grammarly/gector

862

Tasks

Add Remove

Grammatical Error Correction

Language Modelling

Sentence

valid

Datasets

CoNLL Yahoo! Answers

JFLEG

CoNLL-2014 Shared Task: Grammatical Error Correction

WI-LOCNESS GMEG-wiki GMEG-yahoo

Results from the Paper

Edit

Ranked #2 on Grammatical Error Correction on Unrestricted

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Grammatical Error Correction	BEA-2019 (test)	LM-Critic	F0.5	72.9	# 9	Compare
Grammatical Error Correction	CoNLL-2014 Shared Task	LM-Critic	F0.5	65.8	# 7	Compare
Grammatical Error Correction	Restricted	+ BIFI with no critic	F0.5	18.7	# 4	Compare
Grammatical Error Correction	Unrestricted	+ BIFI (ours)	F0.5	42.4	# 2	Compare

Methods

Add Remove

Repair

Edit Social Preview

LM-Critic: Language Models for Unsupervised Grammatical Error Correction

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove