Search Results for author: Aymen Al Marjani

Found 3 papers, 0 papers with code

On the complexity of All $\varepsilon$-Best Arms Identification

no code implementations13 Feb 2022 Aymen Al Marjani, Tomáš Kocák, Aurélien Garivier

Our method is based on a complete characterization of the alternative bandit instances that the optimal sampling strategy needs to rule out, thus making our bound tighter than the one provided by \cite{Mason2020}.

Navigating to the Best Policy in Markov Decision Processes

no code implementations NeurIPS 2021 Aymen Al Marjani, Aurélien Garivier, Alexandre Proutiere

We investigate the classical active pure exploration problem in Markov Decision Processes, where the agent sequentially selects actions and, from the resulting system trajectory, aims at identifying the best policy as fast as possible.

Adaptive Sampling for Best Policy Identification in Markov Decision Processes

no code implementations28 Sep 2020 Aymen Al Marjani, Alexandre Proutiere

We then provide a simple and tight upper bound of the sample complexity lower bound, whose corresponding nearly-optimal sample allocation becomes explicit.

Cannot find the paper you are looking for? You can Submit a new open access paper.