TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Grammatical Error Correction	BEA-2019 (test)	Majority-voting ensemble on best 7 models	F0.5	81.4	# 1
Grammatical Error Correction	CoNLL-2014 Shared Task	Ensembles of best 7 models + GRECO + GTP-rerank	F0.5	72.8	# 1
Grammatical Error Correction	CoNLL-2014 Shared Task	Ensembles of best 7 models + GRECO + GTP-rerank	Precision	83.9	# 1
Grammatical Error Correction	CoNLL-2014 Shared Task	Ensembles of best 7 models + GRECO + GTP-rerank	Recall	47.5	# 3
Grammatical Error Correction	CoNLL-2014 Shared Task	Majority-voting ensemble on best 7 models	F0.5	71.8	# 2
Grammatical Error Correction	CoNLL-2014 Shared Task	Majority-voting ensemble on best 7 models	Precision	83.7	# 2
Grammatical Error Correction	CoNLL-2014 Shared Task	Majority-voting ensemble on best 7 models	Recall	45.7	# 5

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/pillars-of-grammatical-error-correction/grammatical-error-correction-on-bea-2019-test)](https://paperswithcode.com/sota/grammatical-error-correction-on-bea-2019-test?p=pillars-of-grammatical-error-correction)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/pillars-of-grammatical-error-correction/grammatical-error-correction-on-conll-2014)](https://paperswithcode.com/sota/grammatical-error-correction-on-conll-2014?p=pillars-of-grammatical-error-correction)`

Pillars of Grammatical Error Correction: Comprehensive Inspection Of Contemporary Approaches In The Era of Large Language Models

23 Apr 2024 · Kostiantyn Omelianchuk, Andrii Liubonko, Oleksandr Skurzhanskyi, Artem Chernodub, Oleksandr Korniienko, Igor Samokhin ·

In this paper, we carry out experimental research on Grammatical Error Correction, delving into the nuances of single-model systems, comparing the efficiency of ensembling and ranking methods, and exploring the application of large language models to GEC as single-model systems, as parts of ensembles, and as ranking methods. We set new state-of-the-art performance with F_0.5 scores of 72.8 on CoNLL-2014-test and 81.4 on BEA-test, respectively. To support further advancements in GEC and ensure the reproducibility of our research, we make our code, trained models, and systems' outputs publicly available.

PDF Abstract

Code

Add Remove Mark official

grammarly/pillars-of-gec official

Tasks

Add Remove

Grammatical Error Correction

Datasets

CoNLL FCE

CoNLL-2014 Shared Task: Grammatical Error Correction

WI-LOCNESS

Results from the Paper

Edit

Ranked #1 on Grammatical Error Correction on CoNLL-2014 Shared Task

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Grammatical Error Correction	BEA-2019 (test)	Majority-voting ensemble on best 7 models	F0.5	81.4	# 1	Compare
Grammatical Error Correction	CoNLL-2014 Shared Task	Ensembles of best 7 models + GRECO + GTP-rerank	F0.5	72.8	# 1	Compare
			Precision	83.9	# 1	Compare
			Recall	47.5	# 3	Compare
Grammatical Error Correction	CoNLL-2014 Shared Task	Majority-voting ensemble on best 7 models	F0.5	71.8	# 2	Compare
			Precision	83.7	# 2	Compare
			Recall	45.7	# 5	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Pillars of Grammatical Error Correction: Comprehensive Inspection Of Contemporary Approaches In The Era of Large Language Models

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove