no code implementations • 12 Jul 2023 • Kai Cui, Sascha Hauck, Christian Fabian, Heinz Koeppl
However, multi-agent RL (MARL) remains a challenge in terms of decentralization, partial observability and scalability to many agents.
Policy Gradient Methods Reinforcement Learning (RL)