no code implementations • NeurIPS 2021 • Hassan Saber, Pierre Ménard, Odalric-Ambrym Maillard
We consider a multi-armed bandit problem specified by a set of one-dimensional family exponential distributions endowed with a unimodal structure.
1 code implementation • NeurIPS 2021 • Fabien Pesquerel, Hassan Saber, Odalric-Ambrym Maillard
For this structured problem of practical relevance, we first derive the asymptotic regret lower bound and corresponding constrained optimization problem.
no code implementations • 7 Jul 2020 • Hassan Saber, Pierre Ménard, Odalric-Ambrym Maillard
[0, 1]^{\mathcal{A}\times\mathcal{B}}$ and by a given weight matrix $\omega\!=\!
no code implementations • 30 Jun 2020 • Hassan Saber, Pierre Ménard, Odalric-Ambrym Maillard
This strategy is proven optimal.