no code implementations • 11 Apr 2024 • Yang Hu, Haitong Ma, Bo Dai, Na Li
The pursuit of robustness has recently been a popular topic in reinforcement learning (RL) research, yet the existing methods generally suffer from efficiency issues that obstruct their real-world implementation.
no code implementations • 7 Apr 2024 • Haitong Ma, Zhaolin Ren, Bo Dai, Na Li
Moreover, to handle the sim-to-real gap in the dynamics, we propose a skill discovery algorithm that learns new skills caused by the sim-to-real gap from real-world data.
no code implementations • 8 Apr 2023 • Tongzheng Ren, Zhaolin Ren, Haitong Ma, Na Li, Bo Dai
This paper presents an approach, Spectral Dynamics Embedding Control (SDEC), to optimal control for nonlinear stochastic systems.
1 code implementation • 10 Mar 2023 • Haitong Ma, Tianpeng Zhang, Yixuan Wu, Flavio P. Calmon, Na Li
We focus on Entropy Search (ES), a sample-efficient BO algorithm that selects queries to maximize the mutual information about the maximum of the black-box function.
1 code implementation • 14 Oct 2022 • Dongjie Yu, Wenjun Zou, Yujie Yang, Haitong Ma, Shengbo Eben Li, Jingliang Duan, Jianyu Chen
Furthermore, we build a safe RL framework to resolve constraints required by the DRC and its corresponding shield policy.
Model-based Reinforcement Learning reinforcement-learning +2
2 code implementations • 16 May 2022 • Dongjie Yu, Haitong Ma, Shengbo Eben Li, Jianyu Chen
Recent studies incorporate feasible sets into CRL with energy-based methods such as control barrier function (CBF), safety index (SI), and leverage prior conservative estimations of feasible sets, which harms the performance of the learned policy.
1 code implementation • 25 Nov 2021 • Haitong Ma, Changliu Liu, Shengbo Eben Li, Sifa Zheng, Wenchao Sun, Jianyu Chen
Existing methods mostly use the posterior penalty for dangerous actions, which means that the agent is not penalized until experiencing danger.
1 code implementation • 15 Nov 2021 • Haitong Ma, Changliu Liu, Shengbo Eben Li, Sifa Zheng, Jianyu Chen
This paper proposes a novel approach that simultaneously synthesizes the energy-function-based safety certificate and learns the safe control policy with CRL.
3 code implementations • 22 May 2021 • Haitong Ma, Yang Guan, Shegnbo Eben Li, Xiangteng Zhang, Sifa Zheng, Jianyu Chen
The safety constraints commonly used by existing safe reinforcement learning (RL) methods are defined only on expectation of initial states, but allow each certain state to be unsafe, which is unsatisfying for real-world safety-critical tasks.
2 code implementations • 18 Mar 2021 • Yang Guan, Yangang Ren, Qi Sun, Shengbo Eben Li, Haitong Ma, Jingliang Duan, Yifan Dai, Bo Cheng
In this paper, we present an interpretable and computationally efficient framework called integrated decision and control (IDC) for automated vehicles, which decomposes the driving task into static path planning and dynamic optimal tracking that are structured hierarchically.
1 code implementation • 2 Mar 2021 • Haitong Ma, Jianyu Chen, Shengbo Eben Li, Ziyu Lin, Yang Guan, Yangang Ren, Sifa Zheng
Model information can be used to predict future trajectories, so it has huge potential to avoid dangerous region when implementing reinforcement learning (RL) on real-world tasks, like autonomous driving.