Search Results for author: Mark Moldwin

Simple Regret Minimization for Contextual Bandits

Contextual bandits are a sub-class of MABs where, at every time step, the learner has access to side information that is predictive of the best arm.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.