Search Results for author: Liu Leqi

Found 21 papers, 2 papers with code

Accounting for AI and Users Shaping One Another: The Role of Mathematical Models

no code implementations18 Apr 2024 Sarah Dean, Evan Dong, Meena Jagadeesan, Liu Leqi

As AI systems enter into a growing number of societal domains, these systems increasingly shape and are shaped by user preferences, opinions, and behaviors.

counterfactual Recommendation Systems

Personalized Language Modeling from Personalized Human Feedback

no code implementations6 Feb 2024 Xinyu Li, Zachary C. Lipton, Liu Leqi

Reinforcement Learning from Human Feedback (RLHF) is the current dominating framework to fine-tune large language models to better align with human preferences.

Language Modelling Text Summarization

Off-Policy Risk Assessment in Markov Decision Processes

no code implementations21 Sep 2022 Audrey Huang, Liu Leqi, Zachary Chase Lipton, Kamyar Azizzadenesheli

To mitigate these problems, we incorporate model-based estimation to develop the first doubly robust (DR) estimator for the CDF of returns in MDPs.

Multi-Armed Bandits

Supervised Learning with General Risk Functionals

no code implementations27 Jun 2022 Liu Leqi, Audrey Huang, Zachary C. Lipton, Kamyar Azizzadenesheli

Standard uniform convergence results bound the generalization gap of the expected loss over a hypothesis class.

A Taxonomy of Human and ML Strengths in Decision-Making to Investigate Human-ML Complementarity

no code implementations22 Apr 2022 Charvi Rastogi, Liu Leqi, Kenneth Holstein, Hoda Heidari

To illustrate how our taxonomy can be used to investigate complementarity, we provide a mathematical aggregation framework to examine enabling conditions for complementarity.

Decision Making

Modeling Attrition in Recommender Systems with Departing Bandits

no code implementations25 Mar 2022 Omer Ben-Porat, Lee Cohen, Liu Leqi, Zachary C. Lipton, Yishay Mansour

We first address the case where all users share the same type, demonstrating that a recent UCB-based algorithm is optimal.

Multi-Armed Bandits Recommendation Systems

Many Ways to Be Lonely: Fine-Grained Characterization of Loneliness and Its Potential Changes in COVID-19

no code implementations19 Jan 2022 Yueyi Jiang, Yunfan Jiang, Liu Leqi, Piotr Winkielman

To examine how different forms of loneliness and coping strategies manifest in loneliness self-disclosure, we built a dataset, FIG-Loneliness (FIne-Grained Loneliness) by using Reddit posts in two young adult-focused forums and two loneliness related forums consisting of a diverse age group.

Action-Sufficient State Representation Learning for Control with Structural Constraints

no code implementations12 Oct 2021 Biwei Huang, Chaochao Lu, Liu Leqi, José Miguel Hernández-Lobato, Clark Glymour, Bernhard Schölkopf, Kun Zhang

Perceived signals in real-world scenarios are usually high-dimensional and noisy, and finding and using their representation that contains essential and sufficient information required by downstream decision-making tasks will help improve computational efficiency and generalization ability in the tasks.

Computational Efficiency Decision Making +1

On the Convergence and Optimality of Policy Gradient for Markov Coherent Risk

no code implementations4 Mar 2021 Audrey Huang, Liu Leqi, Zachary C. Lipton, Kamyar Azizzadenesheli

Because optimizing the coherent risk is difficult in Markov decision processes, recent work tends to focus on the Markov coherent risk (MCR), a time-consistent surrogate.

Median Optimal Treatment Regimes

no code implementations2 Mar 2021 Liu Leqi, Edward H. Kennedy

In this work, we propose a new median optimal treatment regime that instead treats individuals whose conditional median is higher under treatment.

Rebounding Bandits for Modeling Satiation Effects

no code implementations NeurIPS 2021 Liu Leqi, Fatma Kilinc-Karzan, Zachary C. Lipton, Alan L. Montgomery

Psychological research shows that enjoyment of many goods is subject to satiation, with short-term satisfaction declining after repeated exposures to the same item.

Recommendation Systems

Game Design for Eliciting Distinguishable Behavior

no code implementations NeurIPS 2019 Fan Yang, Liu Leqi, Yifan Wu, Zachary C. Lipton, Pradeep Ravikumar, William W. Cohen, Tom Mitchell

The ability to inferring latent psychological traits from human behavior is key to developing personalized human-interacting machine learning systems.

Automated Dependence Plots

2 code implementations2 Dec 2019 David I. Inouye, Liu Leqi, Joon Sik Kim, Bryon Aragam, Pradeep Ravikumar

To address these drawbacks, we formalize a method for automating the selection of interesting PDPs and extend PDPs beyond showing single features to show the model response along arbitrary directions, for example in raw feature space or a latent space arising from some generative model.

Model Selection Selection bias

On Human-Aligned Risk Minimization

no code implementations NeurIPS 2019 Liu Leqi, Adarsh Prasad, Pradeep K. Ravikumar

The statistical decision theoretic foundations of modern machine learning have largely focused on the minimization of the expectation of some loss function for a given task.

Decision Making Fairness

The Sample Complexity of Semi-Supervised Learning with Nonparametric Mixture Models

no code implementations NeurIPS 2018 Chen Dan, Liu Leqi, Bryon Aragam, Pradeep K. Ravikumar, Eric P. Xing

We study the sample complexity of semi-supervised learning (SSL) and introduce new assumptions based on the mismatch between a mixture model learned from unlabeled data and the true mixture model induced by the (unknown) class conditional distributions.

Binary Classification Classification +2

Sample Complexity of Nonparametric Semi-Supervised Learning

no code implementations NeurIPS 2018 Chen Dan, Liu Leqi, Bryon Aragam, Pradeep Ravikumar, Eric P. Xing

We study the sample complexity of semi-supervised learning (SSL) and introduce new assumptions based on the mismatch between a mixture model learned from unlabeled data and the true mixture model induced by the (unknown) class conditional distributions.

Binary Classification Classification +2

Cannot find the paper you are looking for? You can Submit a new open access paper.