ErAConD (Error Annotated Conversational Dialog Dataset for Grammatical Error Correction)

Introduced by Yuan et al. in ErAConD : Error Annotated Conversational Dialog Dataset for Grammatical Error Correction

ErAConD is a novel GEC dataset consisting of parallel original and corrected utterances drawn from open-domain chatbot conversations.

We collected 186 dialogs containing 1735 user utterance turns of open-domain dialog data by deploying BlenderBot on Amazon Mechanical Turk (AMT) via LEGOEval.

This dataset is, to our knowledge, the first GEC dataset targeted to a human-machine conversational setting.

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks


License


  • MIT License

Modalities


Languages