TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Hurtful Sentence Completion	HONEST	BERT-base	HONEST	1.19	# 1
Hurtful Sentence Completion	HONEST	DistilBERT-base	HONEST	1.90	# 2
Hurtful Sentence Completion	HONEST	RoBERTa-large	HONEST	2.62	# 4
Hurtful Sentence Completion	HONEST	RoBERTa-base	HONEST	2.38	# 3
Hurtful Sentence Completion	HONEST	BERT-large	HONEST	3.33	# 5

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/honest-measuring-hurtful-sentence-completion/hurtful-sentence-completion-on-honest-en)](https://paperswithcode.com/sota/hurtful-sentence-completion-on-honest-en?p=honest-measuring-hurtful-sentence-completion)`

HONEST: Measuring Hurtful Sentence Completion in Language Models

NAACL 2021 · Debora Nozza, Federico Bianchi, Dirk Hovy ·

Language models have revolutionized the field of NLP. However, language models capture and proliferate hurtful stereotypes, especially in text generation. Our results show that 4.3{\%} of the time, language models complete a sentence with a hurtful word. These cases are not random, but follow language and gender-specific patterns. We propose a score to measure hurtful sentence completions in language models (HONEST). It uses a systematic template- and lexicon-based bias evaluation methodology for six languages. Our findings suggest that these models replicate and amplify deep-seated societal stereotypes about gender roles. Sentence completions refer to sexual promiscuity when the target is female in 9{\%} of the time, and in 4{\%} to homosexuality when the target is male. The results raise questions about the use of these models in production settings.

PDF Abstract

Code

Add Remove Mark official

milanlproc/honest

↳ Quickstart in

Colab

Tasks

Add Remove

Hate Speech Detection

Hurtful Sentence Completion

Multilingual NLP

Sentence

Sentence Completion

Text Generation

Datasets

Introduced in the Paper:

HONEST

Results from the Paper

Add Remove

Ranked #1 on Hurtful Sentence Completion on HONEST

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Hurtful Sentence Completion	HONEST	BERT-base	HONEST	1.19	# 1	Compare
Hurtful Sentence Completion	HONEST	DistilBERT-base	HONEST	1.90	# 2	Compare
Hurtful Sentence Completion	HONEST	RoBERTa-large	HONEST	2.62	# 4	Compare
Hurtful Sentence Completion	HONEST	RoBERTa-base	HONEST	2.38	# 3	Compare
Hurtful Sentence Completion	HONEST	BERT-large	HONEST	3.33	# 5	Compare

Methods

Add Remove

Adam • Attention Dropout • BERT • DistilBERT • Dropout • GELU • Layer Normalization • Linear Layer • Linear Warmup With Linear Decay • Multi-Head Attention • Residual Connection • RoBERTa • Scaled Dot-Product Attention • Softmax • Weight Decay • WordPiece

Edit Social Preview

HONEST: Measuring Hurtful Sentence Completion in Language Models

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove