Search Results for author: Inioluwa Deborah Raji

Found 16 papers, 3 papers with code

Medical Large Language Model Benchmarks Should Prioritize Construct Validity

no code implementations12 Mar 2025 Ahmed Alaa, Thomas Hartvigsen, Niloufar Golchini, Shiladitya Dutta, Frances Dean, Inioluwa Deborah Raji, Travis Zack

In the psychological testing literature, "construct validity" refers to the ability of a test to measure an underlying "construct", that is the actual conceptual target of evaluation.

Clinical Knowledge Language Modeling +2

Toward an Evaluation Science for Generative AI Systems

no code implementations7 Mar 2025 Laura Weidinger, Inioluwa Deborah Raji, Hanna Wallach, Margaret Mitchell, Angelina Wang, Olawale Salaudeen, Rishi Bommasani, Deep Ganguli, Sanmi Koyejo, William Isaac

There is an increasing imperative to anticipate and understand the performance and safety of generative AI systems in real-world deployment contexts.

From Individual Experience to Collective Evidence: A Reporting-Based Framework for Identifying Systemic Harms

1 code implementation12 Feb 2025 Jessica Dai, Paula Gradu, Inioluwa Deborah Raji, Benjamin Recht

When an individual reports a negative interaction with some system, how can their personal experience be contextualized within broader patterns of system behavior?

The Data Addition Dilemma

1 code implementation8 Aug 2024 Judy Hanwen Shen, Inioluwa Deborah Raji, Irene Y. Chen

In many machine learning for healthcare tasks, standard datasets are constructed by amassing data across many, often fundamentally dissimilar, sources.

Decision Making Fairness

Concrete Problems in AI Safety, Revisited

no code implementations18 Dec 2023 Inioluwa Deborah Raji, Roel Dobbe

As AI systems proliferate in society, the AI community is increasingly preoccupied with the concept of AI Safety, namely the prevention of failures due to accidents that arise from an unanticipated departure of a system's behavior from designer intent in AI deployment.

Organizational Governance of Emerging Technologies: AI Adoption in Healthcare

no code implementations25 Apr 2023 Jee Young Kim, William Boag, Freya Gulamali, Alifia Hasan, Henry David Jeffry Hogg, Mark Lifson, Deirdre Mulligan, Manesh Patel, Inioluwa Deborah Raji, Ajai Sehgal, Keo Shaw, Danny Tobey, Alexandra Valladares, David Vidal, Suresh Balu, Mark Sendak

This is one of the most detailed qualitative analyses to date of the current governance structures and processes involved in AI adoption by health systems in the United States.

Ethics

AI and the Everything in the Whole Wide World Benchmark

no code implementations26 Nov 2021 Inioluwa Deborah Raji, Emily M. Bender, Amandalynne Paullada, Emily Denton, Alex Hanna

There is a tendency across different subfields in AI to valorize a small collection of influential benchmarks.

Position

About Face: A Survey of Facial Recognition Evaluation

no code implementations1 Feb 2021 Inioluwa Deborah Raji, Genevieve Fried

We survey over 100 face datasets constructed between 1976 to 2019 of 145 million images of over 17 million subjects from a range of sources, demographics and conditions.

Survey

Saving Face: Investigating the Ethical Concerns of Facial Recognition Auditing

no code implementations3 Jan 2020 Inioluwa Deborah Raji, Timnit Gebru, Margaret Mitchell, Joy Buolamwini, Joonseok Lee, Emily Denton

Although essential to revealing biased performance, well intentioned attempts at algorithmic auditing can have effects that may harm the very populations these measures are meant to protect.

Computers and Society

Closing the AI Accountability Gap: Defining an End-to-End Framework for Internal Algorithmic Auditing

no code implementations3 Jan 2020 Inioluwa Deborah Raji, Andrew Smart, Rebecca N. White, Margaret Mitchell, Timnit Gebru, Ben Hutchinson, Jamila Smith-Loud, Daniel Theron, Parker Barnes

Rising concern for the societal implications of artificial intelligence systems has inspired a wave of academic and journalistic literature in which deployed systems are audited for harm by investigators from outside the organizations deploying the algorithms.

Computers and Society

ABOUT ML: Annotation and Benchmarking on Understanding and Transparency of Machine Learning Lifecycles

no code implementations12 Dec 2019 Inioluwa Deborah Raji, Jingying Yang

We present the "Annotation and Benchmarking on Understanding and Transparency of Machine Learning Lifecycles" (ABOUT ML) project as an initiative to operationalize ML transparency and work towards a standard ML documentation practice.

Benchmarking BIG-bench Machine Learning

On the Legal Compatibility of Fairness Definitions

no code implementations25 Nov 2019 Alice Xiang, Inioluwa Deborah Raji

Past literature has been effective in demonstrating ideological gaps in machine learning (ML) fairness definitions when considering their use in complex socio-technical systems.

Fairness

Model Cards for Model Reporting

11 code implementations5 Oct 2018 Margaret Mitchell, Simone Wu, Andrew Zaldivar, Parker Barnes, Lucy Vasserman, Ben Hutchinson, Elena Spitzer, Inioluwa Deborah Raji, Timnit Gebru

Model cards also disclose the context in which models are intended to be used, details of the performance evaluation procedures, and other relevant information.

BIG-bench Machine Learning model

Cannot find the paper you are looking for? You can Submit a new open access paper.