Search Results for author: Xu Kuang

Found 5 papers, 0 papers with code

Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling

no code implementations • 11 Oct 2023 • Zheqing Zhu, Yueyang Liu, Xu Kuang, Benjamin Van Roy

Real-world applications of contextual bandits often exhibit non-stationarity due to seasonality, serendipity, and evolving social trends.

Multi-Armed Bandits

Paper
Add Code

A Definition of Non-Stationary Bandits

no code implementations • 23 Feb 2023 • Yueyang Liu, Xu Kuang, Benjamin Van Roy

Despite the subject of non-stationary bandit learning having attracted much recent attention, we have yet to identify a formal definition of non-stationarity that can consistently distinguish non-stationary bandits from stationary ones.

Paper
Add Code

Experimenting under Stochastic Congestion

no code implementations • 22 Feb 2023 • Shuangning Li, Ramesh Johari, Xu Kuang, Stefan Wager

We study randomized experiments in a service system when stochastic congestion can arise from temporarily limited supply and/or demand.

Experimental Design

Paper
Add Code

Non-Stationary Bandit Learning via Predictive Sampling

no code implementations • 4 May 2022 • Yueyang Liu, Xu Kuang, Benjamin Van Roy

We attribute such failures to the fact that, when exploring, the algorithm does not differentiate actions based on how quickly the information acquired loses its usefulness due to non-stationarity.

Attribute Thompson Sampling

Paper
Add Code

Weak Signal Asymptotics for Sequentially Randomized Experiments

no code implementations • 25 Jan 2021 • Xu Kuang, Stefan Wager

We use the lens of weak signal asymptotics to study a class of sequentially randomized experiments, including those that arise in solving multi-armed bandit problems.

Thompson Sampling

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.