Bias Detection

54 papers with code • 5 benchmarks • 8 datasets

Bias detection is the task of detecting and measuring racism, sexism and otherwise discriminatory behavior in a model (Source: https://stereoset.mit.edu/)

Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations

innodatalabs/innodata-llm-safety 15 Apr 2024

In this research, we used OpenAI GPT as point of comparison since it excels at all levels of safety.

2
15 Apr 2024

OpenBias: Open-set Bias Detection in Text-to-Image Generative Models

picsart-ai-research/openbias 11 Apr 2024

In this paper, we tackle the challenge of open-set bias detection in text-to-image generative models presenting OpenBias, a new pipeline that identifies and quantifies the severity of biases agnostically, without access to any precompiled set.

4
11 Apr 2024

RuBia: A Russian Language Bias Detection Dataset

vergrig/RuBia-Dataset 26 Mar 2024

To illustrate the dataset's purpose, we conduct a diagnostic evaluation of state-of-the-art or near-state-of-the-art LLMs and discuss the LLMs' predisposition to social biases.

0
26 Mar 2024

The Media Bias Taxonomy: A Systematic Literature Review on the Forms and Automated Detection of Media Bias

media-bias-group/media-bias-taxonomy 26 Dec 2023

However, we have identified a lack of interdisciplinarity in existing projects, and a need for more awareness of the various types of media bias to support methodologically thorough performance evaluations of media bias detection systems.

1
26 Dec 2023

LUCID-GAN: Conditional Generative Models to Locate Unfairness

integrated-intelligence-lab/canonical_sets 28 Jul 2023

Most group fairness notions detect unethical biases by computing statistical parity metrics on a model's output.

6
28 Jul 2023

The Hidden Language of Diffusion Models

hila-chefer/Conceptor 1 Jun 2023

In this work, we present Conceptor, a novel method to interpret the internal representation of a textual concept by a diffusion model.

63
01 Jun 2023

A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets

ntunlp/chatgpt_eval 29 May 2023

The development of large language models (LLMs) such as ChatGPT has brought a lot of attention recently.

14
29 May 2023

Trade-Offs Between Fairness and Privacy in Language Modeling

cleolotta/fair-and-private-lm 24 May 2023

Protecting privacy in contemporary NLP models is gaining in importance.

1
24 May 2023

BiasAsker: Measuring the Bias in Conversational AI System

yxwan123/biasasker 21 May 2023

Particularly, it is hard to generate inputs that can comprehensively trigger potential bias due to the lack of data containing both social groups as well as biased properties.

13
21 May 2023