no code implementations • 23 Feb 2024 • Julien Zhou, Pierre Gaillard, Thibaud Rahier, Houssam Zenati, Julyan Arbel
We address the problem of stochastic combinatorial semi-bandits, where a player can select from P subsets of a set containing d base items.