Search Results for author: Jie Bian

Found 1 papers, 0 papers with code

Maillard Sampling: Boltzmann Exploration Done Optimally

no code implementations5 Nov 2021 Jie Bian, Kwang-Sung Jun

This less-known algorithm, which we call Maillard sampling (MS), computes the probability of choosing each arm in a \textit{closed form}, which is not true for Thompson sampling, a widely-adopted bandit algorithm in the industry.

counterfactual Thompson Sampling

Cannot find the paper you are looking for? You can Submit a new open access paper.