no code implementations • ECCV 2020 • Congzhentao Huang, Shuai Jiang, Yang Li, Ziyue Zhang, Jason Traish, Chen Deng, Sam Ferguson, Richard Yi Da Xu
To address this phenomenon, we propose a novel end-to-end training scheme that brings the three separate modules into a single model.
no code implementations • 19 Sep 2021 • Chapman Siu, Jason Traish, Richard Yi Da Xu
We propose using regularization for Multi-Agent Reinforcement Learning rather than learning explicit cooperative structures called {\em Multi-Agent Regularized Q-learning} (MARQ).
no code implementations • 19 Sep 2021 • Chapman Siu, Jason Traish, Richard Yi Da Xu
We demonstrate the flexibility of this approach and how it can be adapted to online contexts where the environment is available to collect experiences and a variety of other contexts.
no code implementations • 19 Sep 2021 • Chapman Siu, Jason Traish, Richard Yi Da Xu
This paper introduces Greedy UnMix (GUM) for cooperative multi-agent reinforcement learning (MARL).