no code implementations • ICLR 2018 • Hangyu Mao, Zhibo Gong, Zhen Xiao
In this paper, we study reward design problem in cooperative MARL based on packet routing environments.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 3 Dec 2019 • Hangyu Mao, Zhengchao Zhang, Zhen Xiao, Zhibo Gong, Yan Ni
We evaluate the gating mechanism on several tasks.
no code implementations • 26 Feb 2019 • Hangyu Mao, Zhibo Gong, Zhengchao Zhang, Zhen Xiao, Yan Ni
Communication is an important factor for the big multi-agent world to stay organized and productive.
no code implementations • 13 Nov 2018 • Hangyu Mao, Zhengchao Zhang, Zhen Xiao, Zhibo Gong
Second, to model the teammates' policies using the collected information in an effective way, ATT-MADDPG enhances the centralized critic with an attention mechanism.
no code implementations • 10 Jun 2017 • Hangyu Mao, Zhibo Gong, Yan Ni, Zhen Xiao
Communication is a critical factor for the big multi-agent world to stay organized and productive.
Multi-agent Reinforcement Learning reinforcement-learning +1