no code implementations • 16 Mar 2023 • Mianchu Wang, Yue Jin, Giovanni Montana
Offline reinforcement learning (RL) aims to infer sequential decision policies using only offline datasets.
no code implementations • 18 Feb 2023 • Yunbo Qiu, Yue Jin, Lebin Yu, Jian Wang, Xudong Zhang
Multi-agent reinforcement learning (MARL) has achieved great progress in cooperative tasks in recent years.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 17 Sep 2022 • Yunbo Qiu, Yuzhu Zhan, Yue Jin, Jian Wang, Xudong Zhang
By pretraining with non-expert demonstrations, PwD-MARL improves sample efficiency in the process of online MARL with a warm start.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 17 Sep 2022 • Yunbo Qiu, Yue Jin, Jian Wang, Xudong Zhang
Flocking control is a challenging problem, where multiple agents, such as drones or vehicles, need to reach a target position while maintaining the flock and avoiding collisions with obstacles and collisions among agents in the environment.
Multi-agent Reinforcement Learning reinforcement-learning +2
no code implementations • 23 May 2022 • Yue Jin, Shuangqing Wei, Jian Yuan, Xudong Zhang
In this paper, we explore the spatiotemporal structure of agents' decisions and consider the hierarchy of coordination from the perspective of multilevel emergence dynamics, based on which a novel approach, Learning to Advise and Learning from Advice (LALA), is proposed to improve MARL.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 29 Sep 2021 • Yue Jin, Shuangqing Wei, Jian Yuan, Xudong Zhang
In multi-agent deep reinforcement learning, extracting sufficient and compact information of other agents is critical to attain efficient convergence and scalability of an algorithm.
Multi-agent Reinforcement Learning reinforcement-learning +2
no code implementations • 9 Jul 2021 • Yue Jin, Tianqing Zheng, Chao GAO, Guoqiang Xu
Analyzing human affect is vital for human-computer interaction systems.
1 code implementation • 3 Jul 2021 • Yue Jin, Yue Zhang, Tao Qin, Xudong Zhang, Jian Yuan, Houqiang Li, Tie-Yan Liu
Inspired by the two observations, in this work, we study a new problem, supervised off-policy ranking (SOPR), which aims to rank a set of target policies based on supervised learning by leveraging off-policy data and policies with known performance.
no code implementations • 11 Aug 2020 • Yongchao Liu, Yue Jin, Yong Chen, Teng Teng, Hang Ou, Rui Zhao, Yao Zhang
Accelerating deep model training and inference is crucial in practice.