Search Results for author: Yanrong Kang

Found 1 papers, 0 papers with code

GuideBoot: Guided Bootstrap for Deep Contextual Bandits

no code implementations18 Jul 2021 Feiyang Pan, Haoming Li, Xiang Ao, Wei Wang, Yanrong Kang, Ao Tan, Qing He

The proposed method is efficient as it can make decisions on-the-fly by utilizing only one randomly chosen model, but is also effective as we show that it can be viewed as a non-Bayesian approximation of Thompson sampling.

Multi-Armed Bandits Thompson Sampling

Cannot find the paper you are looking for? You can Submit a new open access paper.