Abuse Detection

30 papers with code • 0 benchmarks • 4 datasets

Abuse detection is the task of identifying abusive behaviors, such as hate speech, offensive language, sexism and racism, in utterances from social media platforms (Source: https://arxiv.org/abs/1802.00385).

Benchmarks

Add a Result

These leaderboards are used to track progress in Abuse Detection

You can find evaluation results in the subtasks. You can also submitting evaluation metrics for this task.

Datasets

Subtasks

Hate Speech Detection

Latest papers

Most implemented Social Latest No code

Breaking the Silence Detecting and Mitigating Gendered Abuse in Hindi, Tamil, and Indian English Online Spaces

advaithavetagiri/cnlp-nits-pp • 2 Apr 2024

Online gender-based harassment is a widespread issue limiting the free expression and participation of women and marginalized genders in digital spaces.

02 Apr 2024

Paper
Code

TCAB: A Large-Scale Text Classification Attack Benchmark

react-nlp/tcab_generation • • 21 Oct 2022

In addition to the primary tasks of detecting and labeling attacks, TCAB can also be used for attack localization, attack target labeling, and attack characterization.

21 Oct 2022

Paper
Code

Explainable Abuse Detection as Intent Classification and Slot Filling

ago3/plead • • 6 Oct 2022

To proactively offer social media users a safe online experience, there is a need for systems that can detect harmful posts and promptly alert platform moderators.

06 Oct 2022

Paper
Code

Improving Generalizability in Implicitly Abusive Language Detection with Concept Activation Vectors

isarnejad/tcav-for-text-classifiers • • ACL 2022

Robustness of machine learning models on ever-changing real-world data is critical, especially for applications affecting human well-being such as content moderation.

05 Apr 2022

Paper
Code

Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists

g8a9/ear • • Findings (ACL) 2022

EAR also reveals overfitting terms, i. e., terms most likely to induce bias, to help identify their effect on the model, task, and predictions.

17 Mar 2022

Paper
Code

ADIMA: Abuse Detection In Multilingual Audio

sharechatai/adima • • 16 Feb 2022

Abusive content detection in spoken text can be addressed by performing Automatic Speech Recognition (ASR) and leveraging advancements in natural language processing.

16 Feb 2022

Paper
Code

ConvAbuse: Data, Analysis, and Benchmarks for Nuanced Abuse Detection in Conversational AI

amandacurry/convabuse • 20 Sep 2021

We find that the distribution of abuse is vastly different compared to other commonly used datasets, with more sexually tinted aggression towards the virtual persona of these systems.

20 Sep 2021

Paper
Code

AAA: Fair Evaluation for Abuse Detection Systems Wanted

Ago3/Adversifier • ACM Web Science 2021

In this work, we introduce Adversarial Attacks against Abuse (AAA), a new evaluation strategy and associated metric that better captures a model’s performance on certain classes of hard-to-classify microposts, and for example penalises systems which are biased on low-level lexical features.

21 Jun 2021

Paper
Code

AbuseAnalyzer: Abuse Detection, Severity and Target Prediction for Gab Posts

mohit3011/AbuseAnalyzer • • COLING 2020

While extensive popularity of online social media platforms has made information dissemination faster, it has also resulted in widespread online abuse of different types like hate speech, offensive language, sexist and racist opinions, etc.

30 Sep 2020

Paper
Code

KUISAIL at SemEval-2020 Task 12: BERT-CNN for Offensive Speech Identification in Social Media

alisafaya/OffensEval2020 • • SEMEVAL 2020

In this paper, we describe our approach to utilize pre-trained BERT models with Convolutional Neural Networks for sub-task A of the Multilingual Offensive Language Identification shared task (OffensEval 2020), which is a part of the SemEval 2020.

26 Jul 2020

Paper
Code

Abuse Detection

Benchmarks Add a Result

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result