no code implementations • 7 Dec 2020 • Lipeng Wan, Xuwei Song, Xuguang Lan, Nanning Zheng
General methods for policy based multi-agent reinforcement learning to solve the challenge introduce differentiate value functions or advantage functions for individual agents.