Search Results for author: Wentao Weng

Found 3 papers, 1 papers with code

Learning to Defer in Content Moderation: The Human-AI Interplay

no code implementations • 19 Feb 2024 • Thodoris Lykouris, Wentao Weng

The classical learning-theoretic way to capture this human-AI interplay is via the framework of learning to defer, where the algorithm has the option to defer a classification task to humans for a fixed cost and immediately receive feedback.

Scheduling

Paper
Add Code

Efficient decentralized multi-agent learning in asymmetric bipartite queueing systems

no code implementations • 5 Jun 2022 • Daniel Freund, Thodoris Lykouris, Wentao Weng

We study decentralized multi-agent learning in bipartite queueing systems, a standard model for service systems.

Paper
Add Code

The Mean-Squared Error of Double Q-Learning

1 code implementation • NeurIPS 2020 • Wentao Weng, Harsh Gupta, Niao He, Lei Ying, R. Srikant

In this paper, we establish a theoretical comparison between the asymptotic mean-squared error of Double Q-learning and Q-learning.

Q-Learning

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.