Search Results for author: Yanan Sui

Found 22 papers, 6 papers with code

DynSyn: Dynamical Synergistic Representation for Efficient Learning and Control in Overactuated Embodied Systems

no code implementations16 Jul 2024 Kaibo He, Chenhui Zuo, Chengtian Ma, Yanan Sui

The study of these control mechanisms will provide insights into the control of high-dimensional, overactuated systems.

Distributionally Robust Constrained Reinforcement Learning under Strong Duality

no code implementations22 Jun 2024 Zhengfei Zhang, Kishan Panaganti, Laixi Shi, Yanan Sui, Adam Wierman, Yisong Yue

We study the problem of Distributionally Robust Constrained RL (DRC-RL), where the goal is to maximize the expected reward subject to environmental distribution shifts and constraints.

Car Racing reinforcement-learning

Improving sample efficiency of high dimensional Bayesian optimization with MCMC

no code implementations5 Jan 2024 Zeji Yi, Yunyue Wei, Chu Xin Cheng, Kaibo He, Yanan Sui

Sequential optimization methods are often confronted with the curse of dimensionality in high-dimensional spaces.

Bayesian Optimization Thompson Sampling

An Invariant Information Geometric Method for High-Dimensional Online Optimization

1 code implementation3 Jan 2024 Zhengfei Zhang, Yunyue Wei, Yanan Sui

In this paper, we introduce a full invariance oriented evolution strategies algorithm, derived from its corresponding framework, that effectively rivals the leading Bayesian optimization method in tasks with dimensions at the upper limit of Bayesian capability.

Bayesian Optimization

Self Model for Embodied Intelligence: Modeling Full-Body Human Musculoskeletal System and Locomotion Control with Hierarchical Low-Dimensional Representation

no code implementations9 Dec 2023 Chenhui Zuo, Kaibo He, Jing Shao, Yanan Sui

Modeling and control of the human musculoskeletal system is important for understanding human motor functions, developing embodied intelligence, and optimizing human-robot interaction systems.

Automatic Sleep Stage Classification with Cross-modal Self-supervised Features from Deep Brain Signals

no code implementations7 Feb 2023 Chen Gong, Yue Chen, Yanan Sui, Luming Li

This sleep stage classification model could be adapted to chronic and continuous monitor sleep for Parkinson's patients in daily life, and potentially utilized for more precise treatment in deep brain-machine interfaces, such as closed-loop deep brain stimulation.

Automatic Sleep Stage Classification Classification +1

Confidence-Aware Imitation Learning from Demonstrations with Varying Optimality

2 code implementations NeurIPS 2021 Songyuan Zhang, Zhangjie Cao, Dorsa Sadigh, Yanan Sui

Our results show that CAIL significantly outperforms other imitation learning methods from demonstrations with varying optimality.

Imitation Learning

Imitation with Neural Density Models

no code implementations NeurIPS 2021 Kuno Kim, Akshat Jindal, Yang song, Jiaming Song, Yanan Sui, Stefano Ermon

We propose a new framework for Imitation Learning (IL) via density estimation of the expert's occupancy measure followed by Maximum Occupancy Entropy Reinforcement Learning (RL) using the density as a reward.

Density Estimation Imitation Learning +2

Safe Reinforcement Learning in Constrained Markov Decision Processes

1 code implementation ICML 2020 Akifumi Wachi, Yanan Sui

Safe reinforcement learning has been a promising approach for optimizing the policy of an agent that operates in safety-critical applications.

reinforcement-learning Reinforcement Learning (RL) +1

Dueling Posterior Sampling for Preference-Based Reinforcement Learning

1 code implementation4 Aug 2019 Ellen R. Novoseller, Yibing Wei, Yanan Sui, Yisong Yue, Joel W. Burdick

In preference-based reinforcement learning (RL), an agent interacts with the environment while receiving preferences instead of absolute feedback.

reinforcement-learning Reinforcement Learning (RL)

D3TW: Discriminative Differentiable Dynamic Time Warping for Weakly Supervised Action Alignment and Segmentation

no code implementations CVPR 2019 Chien-Yi Chang, De-An Huang, Yanan Sui, Li Fei-Fei, Juan Carlos Niebles

The key technical challenge for discriminative modeling with weak supervision is that the loss function of the ordering supervision is usually formulated using dynamic programming and is thus not differentiable.

Dynamic Time Warping Segmentation +1

Stagewise Safe Bayesian Optimization with Gaussian Processes

no code implementations ICML 2018 Yanan Sui, Vincent Zhuang, Joel W. Burdick, Yisong Yue

We provide theoretical guarantees for both the satisfaction of safety constraints as well as convergence to the optimal utility value.

Bayesian Optimization Decision Making +2

Quantifying Performance of Bipedal Standing with Multi-channel EMG

no code implementations21 Nov 2017 Yanan Sui, Kun Ho Kim, Joel W. Burdick

Spinal cord stimulation has enabled humans with motor complete spinal cord injury (SCI) to independently stand and recover some lost autonomic function.

Electromyography (EMG)

Bellman Gradient Iteration for Inverse Reinforcement Learning

no code implementations24 Jul 2017 Kun Li, Yanan Sui, Joel W. Burdick

We introduce a strategy to flexibly handle different types of actions with two approximations of the Bellman Optimality Equation, and a Bellman Gradient Iteration method to compute the gradient of the Q-value with respect to the reward function.

reinforcement-learning Reinforcement Learning (RL)

Multi-dueling Bandits with Dependent Arms

no code implementations29 Apr 2017 Yanan Sui, Vincent Zhuang, Joel W. Burdick, Yisong Yue

The dueling bandits problem is an online learning framework for learning from pairwise preference feedback, and is particularly well-suited for modeling settings that elicit subjective or implicit human feedback.

Thompson Sampling

Cannot find the paper you are looking for? You can Submit a new open access paper.