TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Automated Essay Scoring	ASAP	Considering-Content-XLNet	Quadratic Weighted Kappa	0.786	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/countering-the-influence-of-essay-length-in/automated-essay-scoring-on-asap)](https://paperswithcode.com/sota/automated-essay-scoring-on-asap?p=countering-the-influence-of-essay-length-in)`

Countering the Influence of Essay Length in Neural Essay Scoring

EMNLP (sustainlp) 2021 · Sungho Jeon, Michael Strube ·

Previous work has shown that automated essay scoring systems, in particular machine learning-based systems, are not capable of assessing the quality of essays, but are relying on essay length, a factor irrelevant to writing proficiency. In this work, we first show that state-of-the-art systems, recent neural essay scoring systems, might be also influenced by the correlation between essay length and scores in a standard dataset. In our evaluation, a very simple neural model shows the state-of-the-art performance on the standard dataset. To consider essay content without taking essay length into account, we introduce a simple neural model assessing the similarity of content between an input essay and essays assigned different scores. This neural model achieves performance comparable to the state of the art on a standard dataset as well as on a second dataset. Our findings suggest that neural essay scoring systems should consider the characteristics of datasets to focus on text quality.

PDF Abstract