Search Results for author: Drishti Wali

Found 1 papers, 0 papers with code

Feedback graph regret bounds for Thompson Sampling and UCB

no code implementations23 May 2019 Thodoris Lykouris, Eva Tardos, Drishti Wali

We study the stochastic multi-armed bandit problem with the graph-based feedback structure introduced by Mannor and Shamir.

Thompson Sampling

Cannot find the paper you are looking for? You can Submit a new open access paper.