no code implementations • 17 Mar 2024 • Zixian Yang, Lei Ying
We prove that our proposed algorithm yields a sublinear regret $\tilde{O}(T^{5/6})$ and queue-length bound $\tilde{O}(T^{2/3})$, where $T$ is the time horizon.
no code implementations • 5 Feb 2023 • Xin Liu, Zixian Yang, Lei Ying
This subroutine also achieves the state-of-the-art regret and constraint violation bounds for constrained online convex optimization problems, which is of independent interest.
no code implementations • 2 Sep 2022 • Zixian Yang, R. Srikant, Lei Ying
We prove that under our algorithm the asymptotic average queue length is bounded by one divided by the traffic slackness, which is order-wise optimal.
no code implementations • 26 May 2022 • Zixian Yang, Xin Liu, Lei Ying
To understand the exploration, exploitation, and engagement in these systems, we propose a new model, called MAB-A where "A" stands for abandonment and the abandonment probability depends on the current recommended item and the user's past experience (called state).