TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Hate Speech Detection	AbusEval	HateBERT	Macro F1	0.742	# 1
Hate Speech Detection	AbusEval	BERT	Macro F1	0.724	# 2
Hate Speech Detection	HatEval	HateBERT	Macro F1	0.494	# 1
Hate Speech Detection	HatEval	BERT	Macro F1	0.48	# 2
Hate Speech Detection	OffensEval 2019	HateBERT	Macro F1	0.805	# 1
Hate Speech Detection	OffensEval 2019	BERT	Macro F1	0.803	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/hatebert-retraining-bert-for-abusive-language/hate-speech-detection-on-abuseval)](https://paperswithcode.com/sota/hate-speech-detection-on-abuseval?p=hatebert-retraining-bert-for-abusive-language)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/hatebert-retraining-bert-for-abusive-language/hate-speech-detection-on-hateval)](https://paperswithcode.com/sota/hate-speech-detection-on-hateval?p=hatebert-retraining-bert-for-abusive-language)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/hatebert-retraining-bert-for-abusive-language/hate-speech-detection-on-offenseval-2019)](https://paperswithcode.com/sota/hate-speech-detection-on-offenseval-2019?p=hatebert-retraining-bert-for-abusive-language)`

HateBERT: Retraining BERT for Abusive Language Detection in English

ACL (WOAH) 2021 · Tommaso Caselli, Valerio Basile, Jelena Mitrović, Michael Granitzer ·

In this paper, we introduce HateBERT, a re-trained BERT model for abusive language detection in English. The model was trained on RAL-E, a large-scale dataset of Reddit comments in English from communities banned for being offensive, abusive, or hateful that we have collected and made available to the public. We present the results of a detailed comparison between a general pre-trained language model and the abuse-inclined version obtained by retraining with posts from the banned communities on three English datasets for offensive, abusive language and hate speech detection tasks. In all datasets, HateBERT outperforms the corresponding general BERT model. We also discuss a battery of experiments comparing the portability of the generic pre-trained language model and its corresponding abusive language-inclined counterpart across the datasets, indicating that portability is affected by compatibility of the annotated phenomena.

PDF Abstract ACL (WOAH) 2021 PDF ACL (WOAH) 2021 Abstract

Code

Add Remove Mark official

tommasoc80/HateBERT official

Tasks

Add Remove

Abusive Language

Hate Speech Detection

Language Modelling

Datasets

HatEval

Results from the Paper

Edit

Ranked #1 on Hate Speech Detection on AbusEval

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Hate Speech Detection	AbusEval	HateBERT	Macro F1	0.742	# 1	Compare
Hate Speech Detection	AbusEval	BERT	Macro F1	0.724	# 2	Compare
Hate Speech Detection	HatEval	HateBERT	Macro F1	0.494	# 1	Compare
Hate Speech Detection	HatEval	BERT	Macro F1	0.48	# 2	Compare
Hate Speech Detection	OffensEval 2019	HateBERT	Macro F1	0.805	# 1	Compare
Hate Speech Detection	OffensEval 2019	BERT	Macro F1	0.803	# 2	Compare

Methods

Add Remove

Adam • Attention Dropout • BERT • Dense Connections • Dropout • GELU • Layer Normalization • Linear Layer • Linear Warmup With Linear Decay • Multi-Head Attention • Residual Connection • Scaled Dot-Product Attention • Softmax • Weight Decay • WordPiece

Edit Social Preview

HateBERT: Retraining BERT for Abusive Language Detection in English

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove