no code implementations • 17 Oct 2018 • Aniket Anand Deshmukh, Srinagesh Sharma, James W. Cutler, Mark Moldwin, Clayton Scott
Contextual bandits are a sub-class of MABs where, at every time step, the learner has access to side information that is predictive of the best arm.