1 code implementation • NeurIPS 2020 • Wentao Weng, Harsh Gupta, Niao He, Lei Ying, R. Srikant
In this paper, we establish a theoretical comparison between the asymptotic mean-squared error of Double Q-learning and Q-learning.
no code implementations • 5 Jun 2022 • Daniel Freund, Thodoris Lykouris, Wentao Weng
We study decentralized multi-agent learning in bipartite queueing systems, a standard model for service systems.
no code implementations • 19 Feb 2024 • Thodoris Lykouris, Wentao Weng
The classical learning-theoretic way to capture this human-AI interplay is via the framework of learning to defer, where the algorithm has the option to defer a classification task to humans for a fixed cost and immediately receive feedback.