FCE (First Certificate in English)

Introduced by Helen Yannakoudakis et al. in A New Dataset and Method for Automatically Grading ESOL Texts

The Cambridge Learner Corpus First Certificate in English (CLC FCE) dataset consists of short texts, written by learners of English as an additional language in response to exam prompts eliciting free-text answers and assessing mastery of the upper-intermediate proficiency level. The texts have been manually error-annotated using a taxonomy of 77 error types. The full dataset consists of 323,192 sentences. The publicly released subset of the dataset, named FCE-public, consists of 33,673 sentences split into test and training sets of 2,720 and 30,953 sentences, respectively.

Source: Compositional Sequence Labeling Models for Error Detection in Learner Writing

Homepage

Benchmarks

Add a new result Link an existing benchmark

Trend	Task	Dataset Variant	Best Model	Paper	Code
	Grammatical Error Detection	FCE	VERNet

Papers

Paper	Code	Results	Date	Stars

Dataset Loaders

Add Remove

No data loaders found. You can submit your data loader here.

Tasks

Grammatical Error Detection

Similar Datasets

AKCES-GEC

CoNLL-2014 Shared Task: Grammatical Error Correction

WI-LOCNESS

JFLEG

Usage

FCE (First Certificate in English)

Benchmarks Edit Add a new result Link an existing benchmark

Papers

Dataset Loaders Edit Add Remove

Tasks Edit

Similar Datasets

AKCES-GEC

CoNLL-2014 Shared Task: Grammatical Error Correction

WI-LOCNESS

JFLEG

Usage

License Edit

Modalities Edit

Languages Edit

Benchmarks

Add a new result Link an existing benchmark

Dataset Loaders

Add Remove

Tasks

License

Modalities

Languages