no code implementations • 11 Oct 2023 • Zheqing Zhu, Yueyang Liu, Xu Kuang, Benjamin Van Roy
Real-world applications of contextual bandits often exhibit non-stationarity due to seasonality, serendipity, and evolving social trends.
no code implementations • 23 Feb 2023 • Yueyang Liu, Xu Kuang, Benjamin Van Roy
Despite the subject of non-stationary bandit learning having attracted much recent attention, we have yet to identify a formal definition of non-stationarity that can consistently distinguish non-stationary bandits from stationary ones.
no code implementations • 22 Feb 2023 • Shuangning Li, Ramesh Johari, Xu Kuang, Stefan Wager
We study randomized experiments in a service system when stochastic congestion can arise from temporarily limited supply and/or demand.
no code implementations • 4 May 2022 • Yueyang Liu, Xu Kuang, Benjamin Van Roy
We attribute such failures to the fact that, when exploring, the algorithm does not differentiate actions based on how quickly the information acquired loses its usefulness due to non-stationarity.
no code implementations • 25 Jan 2021 • Xu Kuang, Stefan Wager
We use the lens of weak signal asymptotics to study a class of sequentially randomized experiments, including those that arise in solving multi-armed bandit problems.