no code implementations • 11 Feb 2023 • Xin Liu, Yaran Chen, Haoran Li, Boyu Li, Dongbin Zhao
Moreover, prototypical representation learning with a novel intrinsic loss is proposed to pre-train an effective and generic encoder across different domains.
1 code implementation • 2 Feb 2023 • Yupeng Zheng, Chengliang Zhong, Pengfei Li, Huan-ang Gao, Yuhang Zheng, Bu Jin, Ling Wang, Hao Zhao, Guyue Zhou, Qichao Zhang, Dongbin Zhao
By fitting a bridge-shaped curve to the illumination map distribution, both regions are suppressed and two tasks are bridged naturally.
no code implementations • 5 Dec 2022 • Jiajun Chai, Wenzhang Chen, Yuanheng Zhu, Zong-xin Yao, Dongbin Zhao
Then the inner loop tracks the macro behavior with a flight controller by calculating the actual input signals for the aircraft.
no code implementations • 23 Nov 2022 • Junjie Wang, Yao Mu, Dong Li, Qichao Zhang, Dongbin Zhao, Yuzheng Zhuang, Ping Luo, Bin Wang, Jianye Hao
The latent world model provides a promising way to learn policies in a compact latent space for tasks with high-dimensional observations, however, its generalization across diverse environments with unseen dynamics remains challenging.
Model-based Reinforcement Learning
reinforcement-learning
+1
no code implementations • 19 Oct 2022 • Ding Li, Qichao Zhang, Shuai Lu, Yifeng Pan, Dongbin Zhao
Our CGTP framework is an end to end and interpretable model, including three main stages: context encoding, goal interactive prediction and trajectory interactive prediction.
no code implementations • 31 Mar 2022 • Qichao Zhang, Yinfeng Gao, Yikang Zhang, Youtian Guo, Dawei Ding, Yunpeng Wang, Peng Sun, Dongbin Zhao
In particular, TrajGen consists of the multi-modal trajectory prediction stage and the reinforcement learning based trajectory modification stage.
no code implementations • 19 Feb 2022 • Yuqi Liu, Qichao Zhang, Dongbin Zhao
In this paper, we formulate a multi-task safe reinforcement learning with social attention to improve the safety and efficiency when interacting with other traffic participants.
no code implementations • 13 Feb 2022 • Nannan Li, Yaran Chen, Weifan Li, Zixiang Ding, Dongbin Zhao
In this paper, we propose the broad attention to improve the performance by incorporating the attention relationship of different layers for vision transformer, which is called BViT.
no code implementations • 15 Nov 2021 • Zixiang Ding, Yaran Chen, Nannan Li, Dongbin Zhao, C. L. Philip Chen
Moreover, multi-scale feature fusion and knowledge embedding are proposed to improve the performance of BCNN with shallow topology.
no code implementations • 22 Sep 2021 • Junjie Wang, Qichao Zhang, Dongbin Zhao
We train several state-of-the-art deep reinforcement learning methods in the designed training scenarios and provide the benchmark metrics evaluation results of the trained models in the test scenarios.
1 code implementation • 22 Sep 2021 • Yuqi Liu, Qichao Zhang, Dongbin Zhao
The test benchmark and baselines are to provide a fair and comprehensive training and testing platform for the study of RL for autonomous driving in the intersection scenario, advancing the progress of RL-based methods for intersection autonomous driving control.
no code implementations • 10 Oct 2020 • Guangzheng Hu, Yuanheng Zhu, Dongbin Zhao, Mengchen Zhao, Jianye Hao
Then the design of the event-triggered strategy is formulated as a constrained Markov decision problem, and reinforcement learning finds the best communication protocol that satisfies the limited bandwidth constraint.
Multiagent Systems
no code implementations • 22 Sep 2020 • Nannan Li, Yu Pan, Yaran Chen, Zixiang Ding, Dongbin Zhao, Zenglin Xu
Interestingly, we discover that part of the rank elements is sensitive and usually aggregate in a narrow region, namely an interest region.
no code implementations • 21 Sep 2020 • Jun-Jie Wang, Qichao Zhang, Dongbin Zhao, Mengchen Zhao, Jianye Hao
Existing model-based value expansion methods typically leverage a world model for value estimation with a fixed rollout horizon to assist policy learning.
Model-based Reinforcement Learning
reinforcement-learning
+1
no code implementations • 18 Sep 2020 • Zixiang Ding, Yaran Chen, Nannan Li, Dongbin Zhao
For this consequent issue, two solutions are given: 1) we propose Confident Learning Rate (CLR) that considers the confidence of gradient for architecture weights update, increasing with the training time of over-parameterized BCNN; 2) we introduce the combination of partial channel connections and edge normalization that also can improve the memory efficiency further.
no code implementations • 15 Jul 2020 • Junwen Chen, Yi Lu, Yaran Chen, Dongbin Zhao, Zhonghua Pang
A good object segmentation should contain clear contours and complete regions.
no code implementations • 18 Apr 2020 • Haoran Li, Yaran Chen, Qichao Zhang, Dongbin Zhao
Considering the bird's eye views(BEV) of the LiDAR remains the space structure in horizontal plane, this paper proposes a bidirectional fusion network(BiFNet) to fuse the image and BEV of the point cloud.
no code implementations • 10 Apr 2020 • Yaran Chen, Ruiyuan Gao, Fenggang Liu, Dongbin Zhao
Unlike previous search algorithms, and benefiting from inherited knowledge, our method is able to directly search for architectures in the macro space by NSGA-II algorithm without tuning parameters in these \textit{module}s. Experiments show that our strategy can efficiently evaluate the performance of new architecture even without tuning weights in convolutional layers.
no code implementations • 31 Mar 2020 • Zhentao Tang, Yuanheng Zhu, Dongbin Zhao, Simon M. Lucas
In contrast to conventional RHEA, an opponent model is proposed and is optimized by supervised learning with cross-entropy and reinforcement learning with policy gradient and Q-learning respectively, based on history observations from opponent.
no code implementations • 18 Jan 2020 • Zixiang Ding, Yaran Chen, Nannan Li, Dongbin Zhao, Zhiquan Sun, C. L. Philip Chen
In this paper, we propose Broad Neural Architecture Search (BNAS) where we elaborately design broad scalable architecture dubbed Broad Convolutional Neural Network (BCNN) to solve the above issue.
no code implementations • 2 Jan 2020 • Yi Lu, Yaran Chen, Dongbin Zhao, Jianxin Chen
Then we apply graph convolutional network to solve this graph node classification problem.
no code implementations • 23 Dec 2019 • Kun Shao, Zhentao Tang, Yuanheng Zhu, Nannan Li, Dongbin Zhao
In this paper, we survey the progress of DRL methods, including value-based, policy gradient, and model-based algorithms, and compare their main techniques and properties.
no code implementations • 11 May 2019 • Dong Li, Qichao Zhang, Dongbin Zhao, Yuzheng Zhuang, Bin Wang, Wulong Liu, Rasul Tutunov, Jun Wang
To address the long-term memory issue, this paper proposes a graph attention memory (GAM) architecture consisting of memory construction module, graph attention module and control module.
no code implementations • 30 Mar 2019 • Jun-Jie Wang, Qichao Zhang, Dongbin Zhao, Yaran Chen
Autonomous driving decision-making is a great challenge due to the complexity and uncertainty of the traffic environment.
1 code implementation • 30 Oct 2018 • Dong Li, Dongbin Zhao, Qichao Zhang, Yaran Chen
The control module which is based on reinforcement learning then makes a control decision based on these features.
1 code implementation • 3 Apr 2018 • Kun Shao, Yuanheng Zhu, Dongbin Zhao
With reinforcement learning and curriculum transfer learning, our units are able to learn appropriate strategies in StarCraft micromanagement scenarios.