no code implementations • 7 Mar 2019 • Yujie Chen, Yu Qian, Yichen Yao, Zili Wu, Rongqi Li, Yinzhi Zhou, Haoyuan Hu, Yinghui Xu
In this paper, we study a courier dispatching problem (CDP) raised from an online pickup-service platform of Alibaba.
Multi-agent Reinforcement Learning Reinforcement Learning