Search Results for author: Emma Pierson

Found 22 papers, 13 papers with code

LLMs generate structurally realistic social networks but overestimate political homophily

1 code implementation29 Aug 2024 Serina Chang, Alicja Chaszczewicz, Emma Wang, Maya Josifovska, Emma Pierson, Jure Leskovec

To answer these questions, we develop three prompting methods for network generation and compare the generated networks to real social networks.

Annotation alignment: Comparing LLM and human annotations of conversational safety

no code implementations10 Jun 2024 Rajiv Movva, Pang Wei Koh, Emma Pierson

We study this question via *annotation alignment*, the extent to which LLMs and humans agree when annotating the safety of user-chatbot conversations.

Chatbot

MEDIQ: Question-Asking LLMs for Adaptive and Reliable Clinical Reasoning

1 code implementation3 Jun 2024 Shuyue Stella Li, Vidhisha Balachandran, Shangbin Feng, Jonathan Ilgen, Emma Pierson, Pang Wei Koh, Yulia Tsvetkov

We develop a reliable Patient system and prototype several Expert systems, first showing that directly prompting state-of-the-art LLMs to ask questions degrades the quality of clinical reasoning, indicating that adapting LLMs to interactive information-seeking settings is nontrivial.

Question Answering

Participation in the age of foundation models

no code implementations29 May 2024 Harini Suresh, Emily Tseng, Meg Young, Mary L. Gray, Emma Pierson, Karen Levy

In addition to the "foundation" layer, our framework proposes the "subfloor'' layer, in which stakeholders develop shared technical infrastructure, norms and governance for a grounded domain, and the "surface'' layer, in which affected communities shape the use of a foundation model for a specific downstream task.

A Bayesian Spatial Model to Correct Under-Reporting in Urban Crowdsourcing

1 code implementation18 Dec 2023 Gabriel Agostini, Emma Pierson, Nikhil Garg

Decision-makers often observe the occurrence of events through a reporting process.

Domain constraints improve risk prediction when outcome data is missing

no code implementations6 Dec 2023 Sidhika Balachandar, Nikhil Garg, Emma Pierson

Though our case study is in healthcare, our analysis reveals a general class of domain constraints which can improve model estimation in many settings.

Decision Making

Reconciling the accuracy-diversity trade-off in recommendations

no code implementations27 Jul 2023 Kenny Peng, Manish Raghavan, Emma Pierson, Jon Kleinberg, Nikhil Garg

In recommendation settings, there is an apparent trade-off between the goals of accuracy (to recommend items a user is most likely to want) and diversity (to recommend items representing a range of categories).

Diversity Recommendation Systems

Topics, Authors, and Institutions in Large Language Model Research: Trends from 17K arXiv Papers

1 code implementation20 Jul 2023 Rajiv Movva, Sidhika Balachandar, Kenny Peng, Gabriel Agostini, Nikhil Garg, Emma Pierson

Large language models (LLMs) are dramatically influencing AI research, spurring discussions on what has changed so far and how to shape the field's future.

Language Modelling Large Language Model

Choosing the Right Weights: Balancing Value, Strategy, and Noise in Recommender Systems

no code implementations27 May 2023 Smitha Milli, Emma Pierson, Nikhil Garg

Many recommender systems are based on optimizing a linear weighting of different user behaviors, such as clicks, likes, shares, etc.

Recommendation Systems

Coarse race data conceals disparities in clinical risk score performance

1 code implementation18 Apr 2023 Rajiv Movva, Divya Shanmugam, Kaihua Hou, Priya Pathak, John Guttag, Nikhil Garg, Emma Pierson

Across outcomes and metrics, we show that the risk scores exhibit significant granular performance disparities within coarse race groups.

How do Authors' Perceptions of their Papers Compare with Co-authors' Perceptions and Peer-review Decisions?

no code implementations22 Nov 2022 Charvi Rastogi, Ivan Stelmakh, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, Jennifer Wortman Vaughan, Zhenyu Xue, Hal Daumé III, Emma Pierson, Nihar B. Shah

In a top-tier computer science conference (NeurIPS 2021) with more than 23, 000 submitting authors and 9, 000 submitted papers, we survey the authors on three questions: (i) their predicted probability of acceptance for each of their papers, (ii) their perceived ranking of their own papers based on scientific contribution, and (iii) the change in their perception about their own papers after seeing the reviews.

Quantifying disparities in intimate partner violence: a machine learning method to correct for underreporting

1 code implementation8 Oct 2021 Divya Shanmugam, Kaihua Hou, Emma Pierson

Estimating the prevalence of a medical condition, or the proportion of the population in which it occurs, is a fundamental problem in healthcare and public health.

Ethical Machine Learning in Health Care

no code implementations22 Sep 2020 Irene Y. Chen, Emma Pierson, Sherri Rose, Shalmali Joshi, Kadija Ferryman, Marzyeh Ghassemi

The use of machine learning (ML) in health care raises numerous ethical concerns, especially as models can amplify existing health inequities.

BIG-bench Machine Learning Ethics

Concept Bottleneck Models

4 code implementations ICML 2020 Pang Wei Koh, Thao Nguyen, Yew Siang Tang, Stephen Mussmann, Emma Pierson, Been Kim, Percy Liang

We seek to learn models that we can interact with using high-level concepts: if the model did not think there was a bone spur in the x-ray, would it still predict severe arthritis?

Inferring Multidimensional Rates of Aging from Cross-Sectional Data

1 code implementation12 Jul 2018 Emma Pierson, Pang Wei Koh, Tatsunori Hashimoto, Daphne Koller, Jure Leskovec, Nicholas Eriksson, Percy Liang

Motivated by the study of human aging, we present an interpretable latent-variable model that learns temporal dynamics from cross-sectional data.

Human Aging Time Series +1

A large-scale analysis of racial disparities in police stops across the United States

3 code implementations18 Jun 2017 Emma Pierson, Camelia Simoiu, Jan Overgoor, Sam Corbett-Davies, Vignesh Ramachandran, Cheryl Phillips, Sharad Goel

We find that black drivers are stopped more often than white drivers relative to their share of the driving-age population, but that Hispanic drivers are stopped less often than whites.

Applications

SIMLR: A Tool for Large-Scale Genomic Analyses by Multi-Kernel Learning

1 code implementation21 Mar 2017 Bo Wang, Daniele Ramazzotti, Luca De Sano, Junjie Zhu, Emma Pierson, Serafim Batzoglou

We here present SIMLR (Single-cell Interpretation via Multi-kernel LeaRning), an open-source tool that implements a novel framework to learn a sample-to-sample similarity measure from expression data observed for heterogenous samples.

Clustering Dimensionality Reduction

Fast Threshold Tests for Detecting Discrimination

1 code implementation27 Feb 2017 Emma Pierson, Sam Corbett-Davies, Sharad Goel

Threshold tests have recently been proposed as a useful method for detecting bias in lending, hiring, and policing decisions.

Cannot find the paper you are looking for? You can Submit a new open access paper.