no code implementations • 13 Apr 2021 • TaeYoung Kim, Luiz Felipe Vecchietti, Kyujin Choi, Sanem Sariel, Dongsoo Har
Because these two training processes are conducted in a series in every timestep, agents can learn how to maximize role rewards and team rewards simultaneously.
Multi-agent Reinforcement Learning reinforcement-learning +2