Search Results for author: Mark Moldwin

Found 1 papers, 0 papers with code

Simple Regret Minimization for Contextual Bandits

no code implementations17 Oct 2018 Aniket Anand Deshmukh, Srinagesh Sharma, James W. Cutler, Mark Moldwin, Clayton Scott

Contextual bandits are a sub-class of MABs where, at every time step, the learner has access to side information that is predictive of the best arm.

Multi-Armed Bandits

Cannot find the paper you are looking for? You can Submit a new open access paper.