Search Results for author: Xu Kuang

Found 5 papers, 0 papers with code

Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling

no code implementations11 Oct 2023 Zheqing Zhu, Yueyang Liu, Xu Kuang, Benjamin Van Roy

Real-world applications of contextual bandits often exhibit non-stationarity due to seasonality, serendipity, and evolving social trends.

Multi-Armed Bandits

A Definition of Non-Stationary Bandits

no code implementations23 Feb 2023 Yueyang Liu, Xu Kuang, Benjamin Van Roy

Despite the subject of non-stationary bandit learning having attracted much recent attention, we have yet to identify a formal definition of non-stationarity that can consistently distinguish non-stationary bandits from stationary ones.

Experimenting under Stochastic Congestion

no code implementations22 Feb 2023 Shuangning Li, Ramesh Johari, Xu Kuang, Stefan Wager

We study randomized experiments in a service system when stochastic congestion can arise from temporarily limited supply and/or demand.

Experimental Design

Non-Stationary Bandit Learning via Predictive Sampling

no code implementations4 May 2022 Yueyang Liu, Xu Kuang, Benjamin Van Roy

We attribute such failures to the fact that, when exploring, the algorithm does not differentiate actions based on how quickly the information acquired loses its usefulness due to non-stationarity.

Attribute Thompson Sampling

Weak Signal Asymptotics for Sequentially Randomized Experiments

no code implementations25 Jan 2021 Xu Kuang, Stefan Wager

We use the lens of weak signal asymptotics to study a class of sequentially randomized experiments, including those that arise in solving multi-armed bandit problems.

Thompson Sampling

Cannot find the paper you are looking for? You can Submit a new open access paper.