Hate Speech Detection

164 papers with code • 14 benchmarks • 39 datasets

Hate speech detection is the task of detecting if communication such as text, audio, and so on contains hatred and or encourages violence towards a person or a group of people. This is usually based on prejudice against 'protected characteristics' such as their ethnicity, gender, sexual orientation, religion, age et al. Some example benchmarks are ETHOS and HateXplain. Models can be evaluated with metrics like the F-score or F-measure.

Benchmarks

Add a Result

These leaderboards are used to track progress in Hate Speech Detection

Dataset	Best Model	Compare
Ethos Binary	BiLSTM + static BE	See all
HateXplain	BERT-MRP	See all
Ethos MultiLabel	MLARAM	See all
Waseem et al., 2018	Mozafari et al., 2019	See all
Automatic Misogynistic Identification	mBert	See all
ToLD-Br	Multilingual BERT	See all
OffensEval 2019	HateBERT	See all
AbusEval	HateBERT	See all
HatEval	HateBERT	See all
Hostility Detection Dataset in Hindi	Auxiliary IndicBert	See all
bajer_danish_misogyny	AOM mBERT	See all
DKhate	Baseline	See all
SHAJ	Baseline BERT (task A)	See all
OLID	RoBERTa-large-ST	See all

Show all 14 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Hate Speech Detection models and implementations

l3cube-pune/MarathiNLP

5 papers

Datasets

Subtasks

Latest papers

Most implemented Social Latest No code

Exploring the Trade-off Between Model Performance and Explanation Plausibility of Text Classifiers Using Human Rationales

visual-ds/plausible-nlp-explanations • 3 Apr 2024

By leveraging a multi-objective optimization algorithm, we explore the trade-off between the two loss functions and generate a Pareto-optimal frontier of models that balance performance and plausibility.

03 Apr 2024

Paper
Code

NaijaHate: Evaluating Hate Speech Detection on Nigerian Twitter Using Representative Data

manueltonneau/naijahate • 28 Mar 2024

To address the global issue of hateful content proliferating in online platforms, hate speech detection (HSD) models are typically developed on datasets collected in the United States, thereby failing to generalize to English dialects from the Majority World.

28 Mar 2024

Paper
Code

Improving Adversarial Data Collection by Supporting Annotators: Lessons from GAHD, a German Hate Speech Dataset

jagol/gahd • 28 Mar 2024

Our experiments show that the resulting dataset is challenging even for state-of-the-art hate speech detection models, and that training on GAHD clearly improves model robustness.

28 Mar 2024

Paper
Code

CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean

rladmstn1714/click • 11 Mar 2024

Despite the rapid development of large language models (LLMs) for the Korean language, there remains an obvious lack of benchmark datasets that test the requisite Korean cultural and linguistic knowledge.

11 Mar 2024

Paper
Code

GPT-HateCheck: Can LLMs Write Better Functional Tests for Hate Speech Detection?

yipingnus/gpt-hate-check • 23 Feb 2024

A recent proposal in this direction is HateCheck, a suite for testing fine-grained model functionalities on synthesized data generated using templates of the kind "You are just a [slur] to me."

23 Feb 2024

Paper
Code

Bryndza at ClimateActivism 2024: Stance, Target and Hate Event Detection via Retrieval-Augmented GPT-4 and LLaMA

naiveneuron/bryndza-case-2024 • 9 Feb 2024

This study details our approach for the CASE 2024 Shared Task on Climate Activism Stance and Hate Event Detection, focusing on Hate Speech Detection, Hate Speech Target Identification, and Stance Detection as classification challenges.

09 Feb 2024

Paper
Code

Probing Critical Learning Dynamics of PLMs for Hate Speech Detection

lcs2-iiitd/hatefinetune • • 3 Feb 2024

Despite the widespread adoption, there is a lack of research into how various critical aspects of pretrained language models (PLMs) affect their performance in hate speech detection.

03 Feb 2024

Paper
Code

Attentive Fusion: A Transformer-based Approach to Multimodal Hate Speech Detection

atanumandal0491/Multimodality-Hate-Speech-Identification • • 19 Jan 2024

With the recent surge and exponential growth of social media usage, scrutinizing social media content for the presence of any hateful content is of utmost importance.

19 Jan 2024

Paper
Code

MetaHate: A Dataset for Unifying Efforts on Hate Speech Detection

palomapiot/metahate • 12 Jan 2024

Hate speech represents a pervasive and detrimental form of online discourse, often manifested through an array of slurs, from hateful tweets to defamatory posts.

12 Jan 2024

Paper
Code

TuPy-E: detecting hate speech in Brazilian Portuguese social media with a novel dataset and comprehensive analysis of models

Silly-Machine/TuPyE-Dataset • 29 Dec 2023

Social media has become integral to human interaction, providing a platform for communication and expression.

29 Dec 2023

Paper
Code

Hate Speech Detection

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result