no code implementations • 16 Oct 2023 • Marc Jourdan, Clémence Réda
Second, when APGAI is combined with a stopping rule, we prove upper bounds on the expected sampling complexity, holding at any confidence level.
no code implementations • 3 Oct 2022 • Marc Jourdan, Rémy Degenne, Emilie Kaufmann
The problem of identifying the best arm among a collection of items having Gaussian rewards distribution is well understood when the variances are known.
no code implementations • 13 Jun 2022 • Marc Jourdan, Rémy Degenne, Dorian Baudry, Rianne de Heide, Emilie Kaufmann
Top Two algorithms arose as an adaptation of Thompson sampling to best arm identification in multi-armed bandit models (Russo, 2016), for parametric families of arms.
no code implementations • 9 Jun 2022 • Marc Jourdan, Rémy Degenne
In pure-exploration problems, information is gathered sequentially to answer a question on the stochastic environment.
no code implementations • 21 Jan 2021 • Marc Jourdan, Mojmír Mutný, Johannes Kirschner, Andreas Krause
Combinatorial bandits with semi-bandit feedback generalize multi-armed bandits, where the agent chooses sets of arms and observes a noisy reward for each arm contained in the chosen set.
1 code implementation • 7 Nov 2018 • Marc Jourdan, Sebastien Blandin, Laura Wynter, Pralhad Deshpande
The Bitcoin transaction graph is a public data structure organized as transactions between addresses, each associated with a logical entity.
1 code implementation • 29 Oct 2018 • Marc Jourdan, Sebastien Blandin, Laura Wynter, Pralhad Deshpande
Bitcoin has created a new exchange paradigm within which financial transactions can be trusted without an intermediary.