Search Results for author: Haitong Ma

Found 11 papers, 8 papers with code

Efficient Duple Perturbation Robustness in Low-rank MDPs

no code implementations11 Apr 2024 Yang Hu, Haitong Ma, Bo Dai, Na Li

The pursuit of robustness has recently been a popular topic in reinforcement learning (RL) research, yet the existing methods generally suffer from efficiency issues that obstruct their real-world implementation.

Reinforcement Learning (RL)

Skill Transfer and Discovery for Sim-to-Real Learning: A Representation-Based Viewpoint

no code implementations7 Apr 2024 Haitong Ma, Zhaolin Ren, Bo Dai, Na Li

Moreover, to handle the sim-to-real gap in the dynamics, we propose a skill discovery algorithm that learns new skills caused by the sim-to-real gap from real-world data.

Representation Learning

Stochastic Nonlinear Control via Finite-dimensional Spectral Dynamic Embedding

no code implementations8 Apr 2023 Tongzheng Ren, Zhaolin Ren, Haitong Ma, Na Li, Bo Dai

This paper presents an approach, Spectral Dynamics Embedding Control (SDEC), to optimal control for nonlinear stochastic systems.

Gaussian Max-Value Entropy Search for Multi-Agent Bayesian Optimization

1 code implementation10 Mar 2023 Haitong Ma, Tianpeng Zhang, Yixuan Wu, Flavio P. Calmon, Na Li

We focus on Entropy Search (ES), a sample-efficient BO algorithm that selects queries to maximize the mutual information about the maximum of the black-box function.

Bayesian Optimization Computational Efficiency

Reachability Constrained Reinforcement Learning

2 code implementations16 May 2022 Dongjie Yu, Haitong Ma, Shengbo Eben Li, Jianyu Chen

Recent studies incorporate feasible sets into CRL with energy-based methods such as control barrier function (CBF), safety index (SI), and leverage prior conservative estimations of feasible sets, which harms the performance of the learned policy.

reinforcement-learning Reinforcement Learning (RL) +1

Learn Zero-Constraint-Violation Policy in Model-Free Constrained Reinforcement Learning

1 code implementation25 Nov 2021 Haitong Ma, Changliu Liu, Shengbo Eben Li, Sifa Zheng, Wenchao Sun, Jianyu Chen

Existing methods mostly use the posterior penalty for dangerous actions, which means that the agent is not penalized until experiencing danger.

reinforcement-learning Reinforcement Learning (RL)

Joint Synthesis of Safety Certificate and Safe Control Policy using Constrained Reinforcement Learning

1 code implementation15 Nov 2021 Haitong Ma, Changliu Liu, Shengbo Eben Li, Sifa Zheng, Jianyu Chen

This paper proposes a novel approach that simultaneously synthesizes the energy-function-based safety certificate and learns the safe control policy with CRL.

reinforcement-learning Reinforcement Learning (RL) +1

Feasible Actor-Critic: Constrained Reinforcement Learning for Ensuring Statewise Safety

3 code implementations22 May 2021 Haitong Ma, Yang Guan, Shegnbo Eben Li, Xiangteng Zhang, Sifa Zheng, Jianyu Chen

The safety constraints commonly used by existing safe reinforcement learning (RL) methods are defined only on expectation of initial states, but allow each certain state to be unsafe, which is unsatisfying for real-world safety-critical tasks.

reinforcement-learning Reinforcement Learning (RL) +2

Integrated Decision and Control: Towards Interpretable and Computationally Efficient Driving Intelligence

2 code implementations18 Mar 2021 Yang Guan, Yangang Ren, Qi Sun, Shengbo Eben Li, Haitong Ma, Jingliang Duan, Yifan Dai, Bo Cheng

In this paper, we present an interpretable and computationally efficient framework called integrated decision and control (IDC) for automated vehicles, which decomposes the driving task into static path planning and dynamic optimal tracking that are structured hierarchically.

Autonomous Driving Model-based Reinforcement Learning +2

Model-based Constrained Reinforcement Learning using Generalized Control Barrier Function

1 code implementation2 Mar 2021 Haitong Ma, Jianyu Chen, Shengbo Eben Li, Ziyu Lin, Yang Guan, Yangang Ren, Sifa Zheng

Model information can be used to predict future trajectories, so it has huge potential to avoid dangerous region when implementing reinforcement learning (RL) on real-world tasks, like autonomous driving.

Autonomous Driving Collision Avoidance +3

Cannot find the paper you are looking for? You can Submit a new open access paper.