Bias Detection

54 papers with code • 5 benchmarks • 8 datasets

Bias detection is the task of detecting and measuring racism, sexism and otherwise discriminatory behavior in a model (Source: https://stereoset.mit.edu/)

Benchmarks

Add a Result

These leaderboards are used to track progress in Bias Detection

Dataset	Best Model	Compare
StereoSet	GPT-2 (small)	See all
rt-inod-bias	GPT-4	See all
Wiki Neutrality Corpus	RoBERTa+ALBERT	See all
PlantVillage_8px	RandomForest_default_hyperparameters	See all
ICAT LLM bias	gpt-4-temp-0	See all

Datasets

Subtasks

Selection bias

Latest papers

Most implemented Social Latest No code

Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations

innodatalabs/innodata-llm-safety • 15 Apr 2024

In this research, we used OpenAI GPT as point of comparison since it excels at all levels of safety.

15 Apr 2024

Paper
Code

OpenBias: Open-set Bias Detection in Text-to-Image Generative Models

picsart-ai-research/openbias • 11 Apr 2024

In this paper, we tackle the challenge of open-set bias detection in text-to-image generative models presenting OpenBias, a new pipeline that identifies and quantifies the severity of biases agnostically, without access to any precompiled set.

11 Apr 2024

Paper
Code

RuBia: A Russian Language Bias Detection Dataset

vergrig/RuBia-Dataset • 26 Mar 2024

To illustrate the dataset's purpose, we conduct a diagnostic evaluation of state-of-the-art or near-state-of-the-art LLMs and discuss the LLMs' predisposition to social biases.

26 Mar 2024

Paper
Code

The Media Bias Taxonomy: A Systematic Literature Review on the Forms and Automated Detection of Media Bias

media-bias-group/media-bias-taxonomy • 26 Dec 2023

However, we have identified a lack of interdisciplinarity in existing projects, and a need for more awareness of the various types of media bias to support methodologically thorough performance evaluations of media bias detection systems.

26 Dec 2023

Paper
Code

Investigating Subtler Biases in LLMs: Ageism, Beauty, Institutional, and Nationality Bias in Generative Models

kamruzzaman15/identifying-subtler-biases-in-llms • 16 Sep 2023

LLMs are increasingly powerful and widely used to assist users in a variety of tasks.

16 Sep 2023

Paper
Code

LUCID-GAN: Conditional Generative Models to Locate Unfairness

integrated-intelligence-lab/canonical_sets • • 28 Jul 2023

Most group fairness notions detect unethical biases by computing statistical parity metrics on a model's output.

28 Jul 2023

Paper
Code

The Hidden Language of Diffusion Models

hila-chefer/Conceptor • 1 Jun 2023

In this work, we present Conceptor, a novel method to interpret the internal representation of a textual concept by a diffusion model.

01 Jun 2023

Paper
Code

A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets

ntunlp/chatgpt_eval • 29 May 2023

The development of large language models (LLMs) such as ChatGPT has brought a lot of attention recently.

29 May 2023

Paper
Code

Trade-Offs Between Fairness and Privacy in Language Modeling

cleolotta/fair-and-private-lm • • 24 May 2023

Protecting privacy in contemporary NLP models is gaining in importance.

24 May 2023

Paper
Code

BiasAsker: Measuring the Bias in Conversational AI System

yxwan123/biasasker • • 21 May 2023

Particularly, it is hard to generate inputs that can comprehensively trigger potential bias due to the lack of data containing both social groups as well as biased properties.

21 May 2023

Paper
Code

Bias Detection

Benchmarks Add a Result

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result