Search Results for author: Raphael Feraud

Found 4 papers, 0 papers with code

Batched Bandits with Crowd Externalities

no code implementations • 29 Sep 2021 • Romain Laroche, Othmane Safsafi, Raphael Feraud, Nicolas Broutin

In Batched Multi-Armed Bandits (BMAB), the policy is not allowed to be updated at each time step.

Paper
Add Code

Context Attentive Bandits: Contextual Bandit with Restricted Context

no code implementations • 10 May 2017 • Djallel Bouneffouf, Irina Rish, Guillermo A. Cecchi, Raphael Feraud

We consider a novel formulation of the multi-armed bandit model, which we call the contextual bandit with restricted context, where only a limited number of features can be accessed by the learner at every iteration.

Recommendation Systems Thompson Sampling

Paper
Add Code

Reinforcement Learning Algorithm Selection

no code implementations • ICLR 2018 • Romain Laroche, Raphael Feraud

This paper formalises the problem of online algorithm selection in the context of Reinforcement Learning.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

A Neural Networks Committee for the Contextual Bandit Problem

no code implementations • 29 Sep 2014 • Robin Allesiardo, Raphael Feraud, Djallel Bouneffouf

This paper presents a new contextual bandit algorithm, NeuralBandit, which does not need hypothesis on stationarity of contexts and rewards.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.