no code implementations • 17 Sep 2022 • Yunbo Qiu, Yuzhu Zhan, Yue Jin, Jian Wang, Xudong Zhang
By pretraining with non-expert demonstrations, PwD-MARL improves sample efficiency in the process of online MARL with a warm start.
Multi-agent Reinforcement Learning reinforcement-learning +1