Search Results for author: Wentao Weng

Found 3 papers, 1 papers with code

Learning to Defer in Content Moderation: The Human-AI Interplay

no code implementations19 Feb 2024 Thodoris Lykouris, Wentao Weng

The classical learning-theoretic way to capture this human-AI interplay is via the framework of learning to defer, where the algorithm has the option to defer a classification task to humans for a fixed cost and immediately receive feedback.

Scheduling

Efficient decentralized multi-agent learning in asymmetric bipartite queueing systems

no code implementations5 Jun 2022 Daniel Freund, Thodoris Lykouris, Wentao Weng

We study decentralized multi-agent learning in bipartite queueing systems, a standard model for service systems.

The Mean-Squared Error of Double Q-Learning

1 code implementation NeurIPS 2020 Wentao Weng, Harsh Gupta, Niao He, Lei Ying, R. Srikant

In this paper, we establish a theoretical comparison between the asymptotic mean-squared error of Double Q-learning and Q-learning.

Q-Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.