Search Results for author: Yanan Sui

Found 20 papers, 6 papers with code

Safe Reinforcement Learning in Constrained Markov Decision Processes

1 code implementation • ICML 2020 • Akifumi Wachi, Yanan Sui

Safe reinforcement learning has been a promising approach for optimizing the policy of an agent that operates in safety-critical applications.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Code

Confidence-Aware Imitation Learning from Demonstrations with Varying Optimality

2 code implementations • NeurIPS 2021 • Songyuan Zhang, Zhangjie Cao, Dorsa Sadigh, Yanan Sui

Our results show that CAIL significantly outperforms other imitation learning methods from demonstrations with varying optimality.

Imitation Learning

Paper
Code

Safe Policy Optimization with Local Generalized Linear Function Approximations

1 code implementation • NeurIPS 2021 • Akifumi Wachi, Yunyue Wei, Yanan Sui

Safe exploration is a key to applying reinforcement learning (RL) in safety-critical systems.

Reinforcement Learning (RL) Safe Exploration

Paper
Code

Dueling Posterior Sampling for Preference-Based Reinforcement Learning

1 code implementation • 4 Aug 2019 • Ellen R. Novoseller, Yibing Wei, Yanan Sui, Yisong Yue, Joel W. Burdick

In preference-based reinforcement learning (RL), an agent interacts with the environment while receiving preferences instead of absolute feedback.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

An Invariant Information Geometric Method for High-Dimensional Online Optimization

1 code implementation • 3 Jan 2024 • Zhengfei Zhang, Yunyue Wei, Yanan Sui

In this paper, we introduce a full invariance oriented evolution strategies algorithm, derived from its corresponding framework, that effectively rivals the leading Bayesian optimization method in tasks with dimensions at the upper limit of Bayesian capability.

Bayesian Optimization

Paper
Code

Quantifying Performance of Bipedal Standing with Multi-channel EMG

no code implementations • 21 Nov 2017 • Yanan Sui, Kun Ho Kim, Joel W. Burdick

Spinal cord stimulation has enabled humans with motor complete spinal cord injury (SCI) to independently stand and recover some lost autonomic function.

Electromyography (EMG)

Paper
Add Code

Bellman Gradient Iteration for Inverse Reinforcement Learning

no code implementations • 24 Jul 2017 • Kun Li, Yanan Sui, Joel W. Burdick

We introduce a strategy to flexibly handle different types of actions with two approximations of the Bellman Optimality Equation, and a Bellman Gradient Iteration method to compute the gradient of the Q-value with respect to the reward function.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Correlational Dueling Bandits with Application to Clinical Treatment in Large Decision Spaces

no code implementations • 8 Jul 2017 • Yanan Sui, Yisong Yue, Joel W. Burdick

This problem can be formulated as a $K$-armed Dueling Bandits problem where $K$ is the total number of decisions.

Decision Making Decision Making Under Uncertainty

Paper
Add Code

Multi-dueling Bandits with Dependent Arms

no code implementations • 29 Apr 2017 • Yanan Sui, Vincent Zhuang, Joel W. Burdick, Yisong Yue

The dueling bandits problem is an online learning framework for learning from pairwise preference feedback, and is particularly well-suited for modeling settings that elicit subjective or implicit human feedback.

Thompson Sampling

Paper
Add Code

Stagewise Safe Bayesian Optimization with Gaussian Processes

no code implementations • ICML 2018 • Yanan Sui, Vincent Zhuang, Joel W. Burdick, Yisong Yue

We provide theoretical guarantees for both the satisfaction of safety constraints as well as convergence to the optimal utility value.

Bayesian Optimization Decision Making +2

Paper
Add Code

D3TW: Discriminative Differentiable Dynamic Time Warping for Weakly Supervised Action Alignment and Segmentation

no code implementations • CVPR 2019 • Chien-Yi Chang, De-An Huang, Yanan Sui, Li Fei-Fei, Juan Carlos Niebles

The key technical challenge for discriminative modeling with weak supervision is that the loss function of the ordering supervision is usually formulated using dynamic programming and is thus not differentiable.

Ranked #5 on Weakly Supervised Action Segmentation (Transcript) on Breakfast

Dynamic Time Warping Segmentation +1

Paper
Add Code

Deepfakes for Medical Video De-Identification: Privacy Protection and Diagnostic Information Preservation

no code implementations • 7 Feb 2020 • Bingquan Zhu, Hao Fang, Yanan Sui, Luming Li

Data sharing for medical research has been difficult as open-sourcing clinical data may violate patient privacy.

De-identification Face Swapping +1

Paper
Add Code

Imitation with Neural Density Models

no code implementations • NeurIPS 2021 • Kuno Kim, Akshat Jindal, Yang song, Jiaming Song, Yanan Sui, Stefano Ermon

We propose a new framework for Imitation Learning (IL) via density estimation of the expert's occupancy measure followed by Maximum Occupancy Entropy Reinforcement Learning (RL) using the density as a reward.

Density Estimation Imitation Learning +2

Paper
Add Code

ROIAL: Region of Interest Active Learning for Characterizing Exoskeleton Gait Preference Landscapes

1 code implementation • 9 Nov 2020 • Kejun Li, Maegan Tucker, Erdem Biyik, Ellen Novoseller, Joel W. Burdick, Yanan Sui, Dorsa Sadigh, Yisong Yue, Aaron D. Ames

ROIAL learns Bayesian posteriors that predict each exoskeleton user's utility landscape across four exoskeleton gait parameters.

Active Learning

Paper
Code

No-Regret Reinforcement Learning with Heavy-Tailed Rewards

no code implementations • 25 Feb 2021 • Vincent Zhuang, Yanan Sui

We consider such scenarios in the setting of undiscounted reinforcement learning.

Q-Learning reinforcement-learning +1

Paper
Add Code

Parkinsonian Chinese Speech Analysis towards Automatic Classification of Parkinson's Disease

no code implementations • 31 May 2021 • Hao Fang, Chen Gong, Chen Zhang, Yanan Sui, Luming Li

Speech disorders often occur at the early stage of Parkinson's disease (PD).

Classification feature selection

Paper
Add Code

Automatic Sleep Stage Classification with Cross-modal Self-supervised Features from Deep Brain Signals

no code implementations • 7 Feb 2023 • Chen Gong, Yue Chen, Yanan Sui, Luming Li

This sleep stage classification model could be adapted to chronic and continuous monitor sleep for Parkinson's patients in daily life, and potentially utilized for more precise treatment in deep brain-machine interfaces, such as closed-loop deep brain stimulation.

Automatic Sleep Stage Classification Classification +1

Paper
Add Code

Self Model for Embodied Intelligence: Modeling Full-Body Human Musculoskeletal System and Locomotion Control with Hierarchical Low-Dimensional Representation

no code implementations • 9 Dec 2023 • Kaibo He, Chenhui Zuo, Jing Shao, Yanan Sui

Modeling and control of the human musculoskeletal system is important for understanding human motor functions, developing embodied intelligence, and optimizing human-robot interaction systems.

Paper
Add Code

Improving sample efficiency of high dimensional Bayesian optimization with MCMC

no code implementations • 5 Jan 2024 • Zeji Yi, Yunyue Wei, Chu Xin Cheng, Kaibo He, Yanan Sui

Sequential optimization methods are often confronted with the curse of dimensionality in high-dimensional spaces.

Bayesian Optimization Thompson Sampling

Paper
Add Code

A Survey of Constraint Formulations in Safe Reinforcement Learning

no code implementations • 3 Feb 2024 • Akifumi Wachi, Xun Shen, Yanan Sui

Ensuring safety is critical when applying reinforcement learning (RL) to real-world problems.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.