TweetBLM: A Hate Speech Dataset and Analysis of Black Lives Matter-related Microblogs on Twitter:

25 Jun 2020 · Sumit Kumar, Raj Ratn Pranesh, Subhash Chandra Pandey ·

In the past few years, there has been a significant rise in toxic and hateful content on various social media platforms. Recently Black Lives Matter movement came into the picture again causing an avalanche of user-generated response on the internet. In this paper, we have proposed a Black Lives Matter related tweet hate speech dataset- TweetBLM. Our dataset is consists of 9165 manually annotated tweets that target the Black Lives Matter movement. The tweets were annotated into two classes, i.e, ”HATE” and ”NON-HATE” on the basis of their content related to racism erupted from the movement. In this work, we also generated useful insights on our dataset and performed a systematic analysis of various state-of-the-art models such as LSTM, Bi-LSTM, Fasttext, BERTbase and BERTlarge for the classification task on our dataset. Through our work, we aim at contributing to the substantial efforts of the research community for identification and mitigation of hate speech on the internet.

PDF

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Hate Speech Detection

Datasets

Add Datasets introduced or used in this paper

Results from the Paper

Add Remove

Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods

Add Remove

LSTM • Mish • Sigmoid Activation • Softplus • Tanh Activation

Edit Social Preview

TweetBLM: A Hate Speech Dataset and Analysis of Black Lives Matter-related Microblogs on Twitter:

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove