no code implementations • 26 Oct 2022 • Yipeng Kang, Tonghan Wang, Xiaoran Wu, Qianlan Yang, Chongjie Zhang
Value decomposition multi-agent reinforcement learning methods learn the global value function as a mixing of each agent's individual utility functions.
no code implementations • NeurIPS 2020 • Yipeng Kang, Tonghan Wang, Gerard de Melo
Emergentism and pragmatics are two research fields that study the dynamics of linguistic communication along substantially different timescales and intelligence levels.
Multi-agent Reinforcement Learning Reinforcement Learning (RL) +2