no code implementations • 5 Jun 2022 • Daniel Freund, Thodoris Lykouris, Wentao Weng
We study decentralized multi-agent learning in bipartite queueing systems, a standard model for service systems.
1 code implementation • NeurIPS 2020 • Wentao Weng, Harsh Gupta, Niao He, Lei Ying, R. Srikant
In this paper, we establish a theoretical comparison between the asymptotic mean-squared error of Double Q-learning and Q-learning.