Search Results for author: Yuval Lewi

Thompson Sampling for Adversarial Bit Prediction

We also bound the regret of those sequences, the worse case sequences have regret $O(\sqrt{T})$ and the best case sequence have regret $O(1)$.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.