no code implementations • 26 Sep 2024 • Jiayu Yao, Weiwei Pan, Finale Doshi-Velez, Barbara E Engelhardt
In this work, we study an inverse reinforcement learning (IRL) problem where the experts are planning under a shared reward function but with different, unknown planning horizons.
no code implementations • 3 Jan 2023 • Ke Jiang, Jiayu Yao, Xiaoyang Tan
In this paper, we propose Contextual Conservative Q-Learning(C-CQL) to learn a robustly reliable policy through the contextual information captured via an inverse dynamics model.
no code implementations • 28 Nov 2022 • Quan Feng, Jiayu Yao, Zhison Pan, Guojun Zhou
Therefore, a more realistic strategy is to leverage semi-supervised learning (SSL) with a small amount of labeled data and a large amount of unlabeled data.
no code implementations • 16 Nov 2022 • Jiayu Yao, Yaniv Yacoby, Beau Coker, Weiwei Pan, Finale Doshi-Velez
Comparing Bayesian neural networks (BNNs) with different widths is challenging because, as the width increases, multiple model properties change simultaneously, and, inference in the finite-width case is intractable.
no code implementations • 2 Aug 2022 • Mark Penrod, Harrison Termotto, Varshini Reddy, Jiayu Yao, Finale Doshi-Velez, Weiwei Pan
For responsible decision making in safety-critical settings, machine learning models must effectively detect and process edge-case data.
no code implementations • 13 Jul 2022 • Jiayu Yao, Sonali Parbhoo, Weiwei Pan, Finale Doshi-Velez
We develop a Reinforcement Learning (RL) framework for improving an existing behavior policy via sparse, user-interpretable changes.
no code implementations • 11 Apr 2022 • Jiayu Yao, Qingyuan Wu, Quan Feng, Songcan Chen
Self-supervised learning (SSL), as a newly emerging unsupervised representation learning paradigm, generally follows a two-stage learning pipeline: 1) learning invariant and discriminative representations with auto-annotation pretext(s), then 2) transferring the representations to assist downstream task(s).
no code implementations • 6 Oct 2021 • Aishwarya Mandyam, Andrew Jones, Jiayu Yao, Krzysztof Laudanski, Barbara Engelhardt
CFQI uses a compositional $Q$-value function with separate modules for each task variant, allowing it to take advantage of shared knowledge while learning distinct policies for each variant.
1 code implementation • 13 Apr 2020 • Jiayu Yao, Emma Brunskill, Weiwei Pan, Susan Murphy, Finale Doshi-Velez
However, when bandits are deployed in the context of a scientific study -- e. g. a clinical trial to test if a mobile health intervention is effective -- the aim is not only to personalize for an individual, but also to determine, with sufficient statistical power, whether or not the system's intervention is effective.
no code implementations • 24 Jun 2019 • Jiayu Yao, Weiwei Pan, Soumya Ghosh, Finale Doshi-Velez
Bayesian Neural Networks (BNNs) place priors over the parameters in a neural network.
1 code implementation • 15 May 2019 • Wanqian Yang, Lars Lorch, Moritz A. Graule, Srivatsan Srinivasan, Anirudh Suresh, Jiayu Yao, Melanie F. Pradier, Finale Doshi-Velez
Bayesian neural network (BNN) priors are defined in parameter space, making it hard to encode prior knowledge expressed in function space.
no code implementations • 16 Nov 2018 • Melanie F. Pradier, Weiwei Pan, Jiayu Yao, Soumya Ghosh, Finale Doshi-Velez
As machine learning systems get widely adopted for high-stake decisions, quantifying uncertainty over predictions becomes crucial.
2 code implementations • ICML 2018 • Soumya Ghosh, Jiayu Yao, Finale Doshi-Velez
Bayesian Neural Networks (BNNs) have recently received increasing attention for their ability to provide well-calibrated posterior uncertainties.
no code implementations • 31 May 2018 • Omer Gottesman, Fredrik Johansson, Joshua Meier, Jack Dent, Dong-hun Lee, Srivatsan Srinivasan, Linying Zhang, Yi Ding, David Wihl, Xuefeng Peng, Jiayu Yao, Isaac Lage, Christopher Mosch, Li-wei H. Lehman, Matthieu Komorowski, Aldo Faisal, Leo Anthony Celi, David Sontag, Finale Doshi-Velez
Much attention has been devoted recently to the development of machine learning algorithms with the goal of improving treatment policies in healthcare.