TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Text Classification	OneStopEnglish (Readability Assessment)	Logistic Regression	Accuracy (5-fold)	0.744	# 5

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/knowledge-rich-bert-embeddings-for/text-classification-on-onestopenglish)](https://paperswithcode.com/sota/text-classification-on-onestopenglish?p=knowledge-rich-bert-embeddings-for)`

BERT Embeddings for Automatic Readability Assessment

RANLP 2021 · Joseph Marvin Imperial ·

Automatic readability assessment (ARA) is the task of evaluating the level of ease or difficulty of text documents for a target audience. For researchers, one of the many open problems in the field is to make such models trained for the task show efficacy even for low-resource languages. In this study, we propose an alternative way of utilizing the information-rich embeddings of BERT models with handcrafted linguistic features through a combined method for readability assessment. Results show that the proposed method outperforms classical approaches in readability assessment using English and Filipino datasets, obtaining as high as 12.4% increase in F1 performance. We also show that the general information encoded in BERT embeddings can be used as a substitute feature set for low-resource languages like Filipino with limited semantic and syntactic NLP tools to explicitly extract feature values for the task.