Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction

Grammatical Error Correction (GEC) aims to correct writing errors and help language learners improve their writing skills. However, existing GEC models tend to produce spurious corrections or fail to detect lots of errors. The quality estimation model is necessary to ensure learners get accurate GEC results and avoid misleading from poorly corrected sentences. Well-trained GEC models can generate several high-quality hypotheses through decoding, such as beam search, which provide valuable GEC evidence and can be used to evaluate GEC quality. However, existing models neglect the possible GEC evidence from different hypotheses. This paper presents the Neural Verification Network (VERNet) for GEC quality estimation with multiple hypotheses. VERNet establishes interactions among hypotheses with a reasoning graph and conducts two kinds of attention mechanisms to propagate GEC evidence to verify the quality of generated hypotheses. Our experiments on four GEC datasets show that VERNet achieves state-of-the-art grammatical error detection performance, achieves the best quality estimation results, and significantly improves GEC performance by reranking hypotheses. All data and source codes are available at

PDF Abstract NAACL 2021 PDF NAACL 2021 Abstract

Results from the Paper

Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Grammatical Error Correction BEA-2019 (test) VERNet F0.5 68.9 # 9
Grammatical Error Detection CoNLL-2014 A1 VERNet F0.5 54.3 # 1
Grammatical Error Detection CoNLL-2014 A2 VERNet F0.5 63.1 # 1
Grammatical Error Correction CoNLL-2014 Shared Task VERNet F0.5 63.7 # 7
Grammatical Error Detection FCE VERNet F0.5 72.2 # 1
Grammatical Error Correction JFLEG VERNet GLEU 62.1 # 1


No methods listed for this paper. Add relevant methods here