Search Results for author: Robin Allesiardo

Found 4 papers, 0 papers with code

Random Shuffling and Resets for the Non-stationary Stochastic Bandit Problem

no code implementations7 Sep 2016 Robin Allesiardo, Raphaël Féraud, Odalric-Ambrym Maillard

For the best-arm identification task, we introduce a version of Successive Elimination based on random shuffling of the $K$ arms.

Random Forest for the Contextual Bandit Problem - extended version

no code implementations27 Apr 2015 Raphaël Féraud, Robin Allesiardo, Tanguy Urvoy, Fabrice Clérot

The dependence of the sample complexity upon the number of contextual variables is logarithmic.

A Neural Networks Committee for the Contextual Bandit Problem

no code implementations29 Sep 2014 Robin Allesiardo, Raphael Feraud, Djallel Bouneffouf

This paper presents a new contextual bandit algorithm, NeuralBandit, which does not need hypothesis on stationarity of contexts and rewards.

Cannot find the paper you are looking for? You can Submit a new open access paper.