1 code implementation • 20 Dec 2024 • Guangchong Zhou, Zeren Zhang, Guoliang Fan
Exploration in cooperative multi-agent reinforcement learning (MARL) remains challenging for value-based agents due to the absence of an explicit policy.
no code implementations • 18 Aug 2024 • Zhiwei Xu, Hangyu Mao, Nianmin Zhang, Xin Xin, Pengjie Ren, Dapeng Li, Bin Zhang, Guoliang Fan, Zhumin Chen, Changwei Wang, Jiangjin Yin
In partially observable multi-agent systems, agents typically only have access to local observations.
no code implementations • 27 Apr 2024 • Dapeng Li, Hang Dong, Lu Wang, Bo Qiao, Si Qin, QIngwei Lin, Dongmei Zhang, Qi Zhang, Zhiwei Xu, Bin Zhang, Guoliang Fan
The entire framework has a message module and an action module.
Multi-agent Reinforcement Learning
reinforcement-learning
+1
no code implementations • 14 Dec 2023 • Dapeng Li, Na Lou, Bin Zhang, Zhiwei Xu, Guoliang Fan
Parameter sharing, as an important technique in multi-agent systems, can effectively solve the scalability issue in large-scale agent problems.
no code implementations • 7 Dec 2023 • Guangchong Zhou, Zhiwei Xu, Zeren Zhang, Guoliang Fan
The coordination between agents in multi-agent systems has become a popular topic in many fields.
no code implementations • 23 Nov 2023 • Bin Zhang, Hangyu Mao, Jingqing Ruan, Ying Wen, Yang Li, Shao Zhang, Zhiwei Xu, Dapeng Li, Ziyue Li, Rui Zhao, Lijuan Li, Guoliang Fan
The remarkable progress in Large Language Models (LLMs) opens up new avenues for addressing planning and decision-making problems in Multi-Agent Systems (MAS).
no code implementations • 13 May 2023 • Bin Zhang, Hangyu Mao, Lijuan Li, Zhiwei Xu, Dapeng Li, Rui Zhao, Guoliang Fan
Our research contributes to the development of an effective and adaptable asynchronous action coordination method that can be widely applied to various task types and environmental configurations in MAS.
no code implementations • 28 Apr 2023 • Dapeng Li, Zhiwei Xu, Bin Zhang, Guoliang Fan
Centralized training with decentralized execution (CTDE) is a widely-used learning paradigm that has achieved significant success in complex tasks.
no code implementations • 25 Apr 2023 • Dapeng Li, Zhiwei Xu, Bin Zhang, Guoliang Fan
In addition, our structure can be applied to various existing mainstream reinforcement learning algorithms with minor modifications and can deal with the problem with a variable number of agents.
no code implementations • 20 Apr 2023 • Bin Zhang, Lijuan Li, Zhiwei Xu, Dapeng Li, Guoliang Fan
In multi-agent reinforcement learning (MARL), self-interested agents attempt to establish equilibrium and achieve coordination depending on game structure.
no code implementations • 21 Mar 2023 • Dapeng Li, Feiyang Pan, Jia He, Zhiwei Xu, Dandan Tu, Guoliang Fan
In high-dimensional time-series analysis, it is essential to have a set of key factors (namely, the style factors) that explain the change of the observed variable.
no code implementations • 4 Feb 2023 • Zhiwei Xu, Bin Zhang, Dapeng Li, Guangchong Zhou, Zeren Zhang, Guoliang Fan
Value decomposition methods have gained popularity in the field of cooperative multi-agent reinforcement learning.
no code implementations • 6 Jun 2022 • Zhiwei Xu, Bin Zhang, Dapeng Li, Zeren Zhang, Guangchong Zhou, Hao Chen, Guoliang Fan
Almost all multi-agent reinforcement learning algorithms without communication follow the principle of centralized training with decentralized execution.
no code implementations • 20 Apr 2022 • Zhiwei Xu, Dapeng Li, Bin Zhang, Yuan Zhan, Yunpeng Bai, Guoliang Fan
Recently, model-based agents have achieved better performance than model-free ones using the same computational budget and training time in single-agent environments.
Multi-agent Reinforcement Learning
reinforcement-learning
+1
no code implementations • 7 Mar 2022 • Bin Zhang, Yunpeng Bai, Zhiwei Xu, Dapeng Li, Guoliang Fan
The application of deep reinforcement learning in multi-agent systems introduces extra challenges.
1 code implementation • 9 Dec 2021 • Yunpeng Bai, Chen Gong, Bin Zhang, Guoliang Fan, Xinwen Hou, Yu Liu
HGCN-MIX models agents as well as their relationships as a hypergraph, where agents are nodes and hyperedges among nodes indicate that the corresponding agents can coordinate to achieve larger rewards.
no code implementations • 14 Oct 2021 • Zhiwei Xu, Yunpeng Bai, Bin Zhang, Dapeng Li, Guoliang Fan
Recently, some challenging tasks in multi-agent systems have been solved by some hierarchical reinforcement learning methods.
Hierarchical Reinforcement Learning
Multi-agent Reinforcement Learning
+5
no code implementations • 24 Sep 2021 • Chen Gong, Qiang He, Yunpeng Bai, Zhou Yang, Xiaoyu Chen, Xinwen Hou, Xianjie Zhang, Yu Liu, Guoliang Fan
In FRL, the policy evaluation and policy improvement phases are simultaneously performed by minimizing the $f$-divergence between the learning policy and sampling policy, which is distinct from conventional DRL algorithms that aim to maximize the expected cumulative rewards.
no code implementations • 22 Jun 2021 • Zhiwei Xu, Dapeng Li, Yunpeng Bai, Guoliang Fan
In the real world, many tasks require multiple agents to cooperate with each other under the condition of local observations.
Distributional Reinforcement Learning
reinforcement-learning
+4
no code implementations • 13 May 2021 • Zhiwei Xu, Yunpeng Bai, Dapeng Li, Bin Zhang, Guoliang Fan
As one of the solutions to the decentralized partially observable Markov decision process (Dec-POMDP) problems, the value decomposition method has achieved significant results recently.
Multi-agent Reinforcement Learning
reinforcement-learning
+3
no code implementations • The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2019 2019 • Duohan Liang, Guoliang Fan, Guangfeng Lin, Wanjun Chen, Xiaorong Pan, Hong Zhu
In this paper, we propose a three-stream convolutional neural network (3SCNN) for action recognition from skeleton sequences, which aims to thoroughly and fully exploit the skeleton data by extracting, learning, fusing and inferring multiple motion-related features, including 3D joint positions and joint displacements across adjacent frames as well as oriented bone segments.
Ranked #66 on
Skeleton Based Action Recognition
on NTU RGB+D
no code implementations • 5 Mar 2018 • Meng Ding, Guoliang Fan
We present a novel parametric 3D shape representation, Generalized sum of Gaussians (G-SoG), which is particularly suitable for pose estimation of articulated objects.