1 code implementation • 21 Feb 2024 • Edwin Zhang, Sadie Zhao, Tonghan Wang, Safwan Hossain, Henry Gasztowtt, Stephan Zheng, David C. Parkes, Milind Tambe, YiLing Chen
Artificial Intelligence (AI) holds promise as a technology that can be used to improve government and economic policy-making.
no code implementations • 7 Feb 2024 • Safwan Hossain, Tonghan Wang, Tao Lin, YiLing Chen, David C. Parkes, Haifeng Xu
The core solution concept here is the Nash equilibrium of senders' signaling policies.
no code implementations • 19 Aug 2023 • Chenghao Li, Tonghan Wang, Chongjie Zhang, Qianchuan Zhao
In the realm of multi-agent reinforcement learning, intrinsic motivations have emerged as a pivotal tool for exploration.
Multi-agent Reinforcement Learning reinforcement-learning +2
1 code implementation • 31 May 2023 • Heng Dong, Junyu Zhang, Tonghan Wang, Chongjie Zhang
Robot design aims at learning to create robots that can be easily controlled and perform tasks efficiently.
1 code implementation • 26 Oct 2022 • Heng Dong, Tonghan Wang, Jiayuan Liu, Chongjie Zhang
Modular Reinforcement Learning (RL) decentralizes the control of multi-joint robots by learning policies for each actuator.
no code implementations • 26 Oct 2022 • Yipeng Kang, Tonghan Wang, Xiaoran Wu, Qianlan Yang, Chongjie Zhang
Value decomposition multi-agent reinforcement learning methods learn the global value function as a mixing of each agent's individual utility functions.
no code implementations • 9 Mar 2022 • Rongjun Qin, Feng Chen, Tonghan Wang, Lei Yuan, Xiaoran Wu, Zongzhang Zhang, Chongjie Zhang, Yang Yu
We demonstrate that the task representation can capture the relationship among tasks, and can generalize to unseen tasks.
1 code implementation • 7 Dec 2021 • Qianlan Yang, Weijun Dong, Zhizhou Ren, Jianhao Wang, Tonghan Wang, Chongjie Zhang
However, one critical challenge in this paradigm is the complexity of greedy action selection with respect to the factorized values.
no code implementations • 15 Oct 2021 • Siyang Wu, Tonghan Wang, Chenghao Li, Yang Hu, Chongjie Zhang
Multi-agent reinforcement learning tasks put a high demand on the volume of training samples.
no code implementations • 29 Sep 2021 • Heng Dong, Tonghan Wang, Jiayuan Liu, Chi Han, Chongjie Zhang
Promoting cooperation among self-interested agents is a long-standing and interdisciplinary problem, but receives less attention in multi-agent reinforcement learning (MARL).
1 code implementation • ICLR 2022 • Tonghan Wang, Liang Zeng, Weijun Dong, Qianlan Yang, Yang Yu, Chongjie Zhang
Learning sparse coordination graphs adaptive to the coordination dynamics among agents is a long-standing problem in cooperative multi-agent learning.
2 code implementations • NeurIPS 2021 • Chenghao Li, Tonghan Wang, Chengjie WU, Qianchuan Zhao, Jun Yang, Chongjie Zhang
Recently, deep multi-agent reinforcement learning (MARL) has shown the promise to solve complex cooperative tasks.
Multi-agent Reinforcement Learning reinforcement-learning +3
no code implementations • 23 Apr 2021 • Heng Dong, Tonghan Wang, Jiayuan Liu, Chi Han, Chongjie Zhang
We propose a novel learning framework to encourage homophilic incentives and show that it achieves stable cooperation in both SSDs of public goods and tragedy of the commons.
no code implementations • ICLR 2021 • Yihan Wang, Beining Han, Tonghan Wang, Heng Dong, Chongjie Zhang
In this paper, we investigate causes that hinder the performance of MAPG algorithms and present a multi-agent decomposed policy gradient method (DOP).
2 code implementations • ICLR 2021 • Tonghan Wang, Tarun Gupta, Anuj Mahajan, Bei Peng, Shimon Whiteson, Chongjie Zhang
Learning a role selector based on action effects makes role discovery much easier because it forms a bi-level learning hierarchy -- the role selector searches in a smaller role space and at a lower temporal resolution, while role policies learn in significantly reduced primitive action-observation spaces.
1 code implementation • 24 Jul 2020 • Yihan Wang, Beining Han, Tonghan Wang, Heng Dong, Chongjie Zhang
In this paper, we investigate causes that hinder the performance of MAPG algorithms and present a multi-agent decomposed policy gradient method (DOP).
no code implementations • NeurIPS 2020 • Yipeng Kang, Tonghan Wang, Gerard de Melo
Emergentism and pragmatics are two research fields that study the dynamics of linguistic communication along substantially different timescales and intelligence levels.
Multi-agent Reinforcement Learning Reinforcement Learning (RL) +2
1 code implementation • ICML 2020 • Tonghan Wang, Heng Dong, Victor Lesser, Chongjie Zhang
In this paper, we synergize these two paradigms and propose a role-oriented MARL framework (ROMA).
Multiagent Systems
1 code implementation • ICLR 2020 • Tonghan Wang, Jianhao Wang, Yi Wu, Chongjie Zhang
We present two exploration methods: exploration via information-theoretic influence (EITI) and exploration via decision-theoretic influence (EDTI), by exploiting the role of interaction in coordinated behaviors of agents.
1 code implementation • ICLR 2020 • Tonghan Wang, Jianhao Wang, Chongyi Zheng, Chongjie Zhang
Recently, value function factorization learning emerges as a promising way to address these challenges in collaborative multi-agent systems.
no code implementations • 7 Mar 2019 • Xinliang Song, Tonghan Wang, Chongjie Zhang
Learning in a multi-agent system is challenging because agents are simultaneously learning and the environment is not stationary, undermining convergence guarantees.
no code implementations • 22 Aug 2017 • Huizhen Jia, Lu Zhang, Tonghan Wang
Contrast is an inherent visual attribute that indicates image quality, and visual saliency (VS) is a quality that attracts the attention of human beings.