Search Results for author: Zixian Yang

Found 4 papers, 0 papers with code

Learning-Based Pricing and Matching for Two-Sided Queues

no code implementations • 17 Mar 2024 • Zixian Yang, Lei Ying

We prove that our proposed algorithm yields a sublinear regret $\tilde{O}(T^{5/6})$ and queue-length bound $\tilde{O}(T^{2/3})$, where $T$ is the time horizon.

Paper
Add Code

Online Nonstochastic Control with Adversarial and Static Constraints

no code implementations • 5 Feb 2023 • Xin Liu, Zixian Yang, Lei Ying

This subroutine also achieves the state-of-the-art regret and constraint violation bounds for constrained online convex optimization problems, which is of independent interest.

Paper
Add Code

Learning While Scheduling in Multi-Server Systems with Unknown Statistics: MaxWeight with Discounted UCB

no code implementations • 2 Sep 2022 • Zixian Yang, R. Srikant, Lei Ying

We prove that under our algorithm the asymptotic average queue length is bounded by one divided by the traffic slackness, which is order-wise optimal.

Scheduling

Paper
Add Code

Exploration, Exploitation, and Engagement in Multi-Armed Bandits with Abandonment

no code implementations • 26 May 2022 • Zixian Yang, Xin Liu, Lei Ying

To understand the exploration, exploitation, and engagement in these systems, we propose a new model, called MAB-A where "A" stands for abandonment and the abandonment probability depends on the current recommended item and the user's past experience (called state).

Multi-Armed Bandits Q-Learning +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.