no code implementations • 16 Feb 2015 • Paul Reverdy, Naomi E. Leonard
With an eye towards human-centered automation, we contribute to the development of a systematic means to infer features of human decision-making from behavioral data.
no code implementations • 23 Jul 2013 • Paul Reverdy, Vaibhav Srivastava, Naomi E. Leonard
We develop the upper credible limit (UCL) algorithm for the standard multi-armed bandit problem and show that this deterministic algorithm achieves logarithmic cumulative expected regret, which is optimal performance for uninformative priors.