K-MHaS: Korean Multi-label Hate Speech Dataset

Introduced by Lee et al. in K-MHaS: A Multi-label Hate Speech Detection Dataset in Korean Online News Comment

Korean Multi-label Hate Speech Dataset

We introduce K-MHaS, a new multi-label dataset for hate speech detection that effectively handles Korean language patterns.

consisting of 109,692 utterances from Korean online news comments, labeled with 8 fine-grained hate speech classes.
data collection period: between January 2018 and June 2020.
providing (a) binary classification and (b) multi-label classification from 1(one) to 4(four) labels.
(a) binary classification: Hate Speech or Not Hate Speech
(b) fine-grained classification: Politics, Origin, Physical, Age, Gender, Religion, Race, and Profanity.

For the fine-grained classification, a Hate Speech class from the binary classification is broken down into eight classes, associated with the hate speech category.

Homepage

Benchmarks

Add a new result Link an existing benchmark

No benchmarks yet. Start a new benchmark or link an existing one.

Papers

Paper	Code	Results	Date	Stars

Dataset Loaders

Add Remove

huggingface/datasets (korean_hate_speech_copy)

18,376

huggingface/datasets (kmhas_korean_hate_speech)

18,376

adlnlp/K-MHaS

K-MHaS: Korean Multi-label Hate Speech Dataset

Benchmarks

Add a new result Link an existing benchmark

Papers

Dataset Loaders

Add Remove

Tasks

Usage

License

Modalities

Languages

K-MHaS: Korean Multi-label Hate Speech Dataset

Benchmarks Edit Add a new result Link an existing benchmark

Papers

Dataset Loaders Edit Add Remove

Tasks Edit

Usage

License Edit

Modalities Edit

Languages Edit

Benchmarks

Add a new result Link an existing benchmark

Dataset Loaders

Add Remove

Tasks

License

Modalities

Languages