no code implementations • 23 Dec 2024 • Yizhe Li, Linrui Zhang, Xueqian Wang, Houde Liu, Bin Liang
Safety-critical traffic scenarios are of great practical relevance to evaluating the robustness of autonomous driving (AD) systems.
no code implementations • 19 Oct 2023 • Linrui Zhang, Zhenghao Peng, Quanyi Li, Bolei Zhou
Driving safety is a top priority for autonomous vehicles.
1 code implementation • 9 Oct 2023 • Longxiang He, Li Shen, Linrui Zhang, Junbo Tan, Xueqian Wang
Constrained policy search (CPS) is a fundamental problem in offline reinforcement learning, which is generally solved by advantage weighted regression (AWR).
1 code implementation • 20 Sep 2023 • Haoyu Wang, Guozheng Ma, Cong Yu, Ning Gui, Linrui Zhang, Zhiqi Huang, Suwei Ma, Yongzhe Chang, Sen Zhang, Li Shen, Xueqian Wang, Peilin Zhao, DaCheng Tao
Notably, we are surprised to discover that robustness tends to decrease as fine-tuning (SFT and RLHF) is conducted.
no code implementations • 28 Jan 2023 • Qin Zhang, Linrui Zhang, Haoran Xu, Li Shen, Bowen Wang, Yongzhe Chang, Xueqian Wang, Bo Yuan, DaCheng Tao
Offline safe RL is of great practical relevance for deploying agents in real-world applications.
no code implementations • 14 Dec 2022 • Linrui Zhang, Zichen Yan, Li Shen, Shoujie Li, Xueqian Wang, DaCheng Tao
On the other hand, the safe agent mimics the baseline agent for policy improvement and learns to fulfill safety constraints via off-policy RL tuning.
no code implementations • 12 Dec 2022 • Linrui Zhang, Qin Zhang, Li Shen, Bo Yuan, Xueqian Wang, DaCheng Tao
Despite a large number of reinforcement learning (RL) methods focusing on safety-critical tasks, there is still a lack of high-quality evaluation of those algorithms that adheres to safety constraints at each decision step under complex and unknown dynamics.
3 code implementations • 15 Sep 2022 • Long Yang, Jiaming Ji, Juntao Dai, Linrui Zhang, Binbin Zhou, Pengfei Li, Yaodong Yang, Gang Pan
Compared to previous safe RL methods, CUP enjoys the benefits of 1) CUP generalizes the surrogate functions to generalized advantage estimator (GAE), leading to strong empirical performance.
1 code implementation • 17 Jun 2022 • Linrui Zhang, Qin Zhang, Li Shen, Bo Yuan, Xueqian Wang
Safe reinforcement learning (RL) has achieved significant success on risk-sensitive tasks and shown promise in autonomous driving (AD) as well.
no code implementations • 24 May 2022 • Linrui Zhang, Li Shen, Long Yang, Shixiang Chen, Bo Yuan, Xueqian Wang, DaCheng Tao
Safe reinforcement learning aims to learn the optimal policy while satisfying safety constraints, which is essential in real-world applications.
no code implementations • LREC 2020 • Linrui Zhang, Hsin-Lun Huang, Yang Yu, Dan Moldovan
As opposed to the traditional machine learning models which require considerable effort in designing task specific features, our model can be well adapted to the proposed tasks with a very limited amount of fine-tuning, which significantly reduces the manual effort in feature engineering.
no code implementations • COLING 2018 • Linrui Zhang, Dan Moldovan
This paper presents a neural net approach to determine Semantic Textual Similarity (STS) using attention-based bidirectional Long Short-Term Memory Networks (Bi-LSTM).