TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Hate Speech Detection	HateXplain	BERT-MRP	AUROC	0.862	# 1
Hate Speech Detection	HateXplain	BERT-MRP	Accuracy	0.704	# 2
Hate Speech Detection	HateXplain	BERT-MRP	Macro F1	0.699	# 1
Hate Speech Detection	HateXplain	BERT-RP	AUROC	0.853	# 2
Hate Speech Detection	HateXplain	BERT-RP	Accuracy	0.707	# 1
Hate Speech Detection	HateXplain	BERT-RP	Macro F1	0.693	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/why-is-it-hate-speech-masked-rationale-1/hate-speech-detection-on-hatexplain)](https://paperswithcode.com/sota/hate-speech-detection-on-hatexplain?p=why-is-it-hate-speech-masked-rationale-1)`

Why Is It Hate Speech? Masked Rationale Prediction for Explainable Hate Speech Detection

COLING 2022 · Jiyun Kim, Byounghan Lee, Kyung-Ah Sohn ·

In a hate speech detection model, we should consider two critical aspects in addition to detection performance-bias and explainability. Hate speech cannot be identified based solely on the presence of specific words: the model should be able to reason like humans and be explainable. To improve the performance concerning the two aspects, we propose Masked Rationale Prediction (MRP) as an intermediate task. MRP is a task to predict the masked human rationales-snippets of a sentence that are grounds for human judgment-by referring to surrounding tokens combined with their unmasked rationales. As the model learns its reasoning ability based on rationales by MRP, it performs hate speech detection robustly in terms of bias and explainability. The proposed method generally achieves state-of-the-art performance in various metrics, demonstrating its effectiveness for hate speech detection.

PDF Abstract COLING 2022 PDF COLING 2022 Abstract

Code

Add Remove Mark official

alatteaday/mrp_hate-speech-detection official

Tasks

Add Remove

Hate Speech Detection

Sentence

Datasets

HateXplain

Results from the Paper

Edit

Ranked #1 on Hate Speech Detection on HateXplain

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Hate Speech Detection	HateXplain	BERT-MRP	AUROC	0.862	# 1	Compare
			Accuracy	0.704	# 2	Compare
			Macro F1	0.699	# 1	Compare
Hate Speech Detection	HateXplain	BERT-RP	AUROC	0.853	# 2	Compare
			Accuracy	0.707	# 1	Compare
			Macro F1	0.693	# 2	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Why Is It Hate Speech? Masked Rationale Prediction for Explainable Hate Speech Detection

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove