Search Results for author: Jiayu Yao

Found 14 papers, 3 papers with code

Inverse Reinforcement Learning with Multiple Planning Horizons

no code implementations26 Sep 2024 Jiayu Yao, Weiwei Pan, Finale Doshi-Velez, Barbara E Engelhardt

In this work, we study an inverse reinforcement learning (IRL) problem where the experts are planning under a shared reward function but with different, unknown planning horizons.

reinforcement-learning Reinforcement Learning

Contextual Conservative Q-Learning for Offline Reinforcement Learning

no code implementations3 Jan 2023 Ke Jiang, Jiayu Yao, Xiaoyang Tan

In this paper, we propose Contextual Conservative Q-Learning(C-CQL) to learn a robustly reliable policy through the contextual information captured via an inverse dynamics model.

Q-Learning reinforcement-learning +2

Deep Semi-supervised Learning with Double-Contrast of Features and Semantics

no code implementations28 Nov 2022 Quan Feng, Jiayu Yao, Zhison Pan, Guojun Zhou

Therefore, a more realistic strategy is to leverage semi-supervised learning (SSL) with a small amount of labeled data and a large amount of unlabeled data.

Representation Learning

An Empirical Analysis of the Advantages of Finite- v.s. Infinite-Width Bayesian Neural Networks

no code implementations16 Nov 2022 Jiayu Yao, Yaniv Yacoby, Beau Coker, Weiwei Pan, Finale Doshi-Velez

Comparing Bayesian neural networks (BNNs) with different widths is challenging because, as the width increases, multiple model properties change simultaneously, and, inference in the finite-width case is intractable.

Success of Uncertainty-Aware Deep Models Depends on Data Manifold Geometry

no code implementations2 Aug 2022 Mark Penrod, Harrison Termotto, Varshini Reddy, Jiayu Yao, Finale Doshi-Velez, Weiwei Pan

For responsible decision making in safety-critical settings, machine learning models must effectively detect and process edge-case data.

Decision Making Deep Learning

Policy Optimization with Sparse Global Contrastive Explanations

no code implementations13 Jul 2022 Jiayu Yao, Sonali Parbhoo, Weiwei Pan, Finale Doshi-Velez

We develop a Reinforcement Learning (RL) framework for improving an existing behavior policy via sparse, user-interpretable changes.

reinforcement-learning Reinforcement Learning +1

Learning Downstream Task by Selectively Capturing Complementary Knowledge from Multiple Self-supervisedly Learning Pretexts

no code implementations11 Apr 2022 Jiayu Yao, Qingyuan Wu, Quan Feng, Songcan Chen

Self-supervised learning (SSL), as a newly emerging unsupervised representation learning paradigm, generally follows a two-stage learning pipeline: 1) learning invariant and discriminative representations with auto-annotation pretext(s), then 2) transferring the representations to assist downstream task(s).

Representation Learning Self-Supervised Learning

Compositional Q-learning for electrolyte repletion with imbalanced patient sub-populations

no code implementations6 Oct 2021 Aishwarya Mandyam, Andrew Jones, Jiayu Yao, Krzysztof Laudanski, Barbara Engelhardt

CFQI uses a compositional $Q$-value function with separate modules for each task variant, allowing it to take advantage of shared knowledge while learning distinct policies for each variant.

Decision Making Navigate +5

Power Constrained Bandits

1 code implementation13 Apr 2020 Jiayu Yao, Emma Brunskill, Weiwei Pan, Susan Murphy, Finale Doshi-Velez

However, when bandits are deployed in the context of a scientific study -- e. g. a clinical trial to test if a mobile health intervention is effective -- the aim is not only to personalize for an individual, but also to determine, with sufficient statistical power, whether or not the system's intervention is effective.

Decision Making Multi-Armed Bandits

Output-Constrained Bayesian Neural Networks

1 code implementation15 May 2019 Wanqian Yang, Lars Lorch, Moritz A. Graule, Srivatsan Srinivasan, Anirudh Suresh, Jiayu Yao, Melanie F. Pradier, Finale Doshi-Velez

Bayesian neural network (BNN) priors are defined in parameter space, making it hard to encode prior knowledge expressed in function space.

Projected BNNs: Avoiding weight-space pathologies by learning latent representations of neural network weights

no code implementations16 Nov 2018 Melanie F. Pradier, Weiwei Pan, Jiayu Yao, Soumya Ghosh, Finale Doshi-Velez

As machine learning systems get widely adopted for high-stake decisions, quantifying uncertainty over predictions becomes crucial.

Variational Inference

Structured Variational Learning of Bayesian Neural Networks with Horseshoe Priors

2 code implementations ICML 2018 Soumya Ghosh, Jiayu Yao, Finale Doshi-Velez

Bayesian Neural Networks (BNNs) have recently received increasing attention for their ability to provide well-calibrated posterior uncertainties.

Model Selection Open-Ended Question Answering +3

Cannot find the paper you are looking for? You can Submit a new open access paper.