no code implementations • 23 May 2019 • Thodoris Lykouris, Eva Tardos, Drishti Wali
We study the stochastic multi-armed bandit problem with the graph-based feedback structure introduced by Mannor and Shamir.
Thompson Sampling