Search Results for author: Paul Reverdy

Found 4 papers, 0 papers with code

Satisficing in multi-armed bandit problems

no code implementations • 23 Dec 2015 • Paul Reverdy, Vaibhav Srivastava, Naomi Ehrich Leonard

Satisficing is a relaxation of maximizing and allows for less risky decision making in the face of uncertainty.

Paper
Add Code

Correlated Multiarmed Bandit Problem: Bayesian Algorithms and Regret Analysis

no code implementations • 5 Jul 2015 • Vaibhav Srivastava, Paul Reverdy, Naomi Ehrich Leonard

We consider the correlated multiarmed bandit (MAB) problem in which the rewards associated with each arm are modeled by a multivariate Gaussian random variable, and we investigate the influence of the assumptions in the Bayesian prior on the performance of the upper credible limit (UCL) algorithm and a new correlated UCL algorithm.

Decision Making

Paper
Add Code

Parameter estimation in softmax decision-making models with linear objective functions

no code implementations • 16 Feb 2015 • Paul Reverdy, Naomi E. Leonard

With an eye towards human-centered automation, we contribute to the development of a systematic means to infer features of human decision-making from behavioral data.

Decision Making

Paper
Add Code

Modeling Human Decision-making in Generalized Gaussian Multi-armed Bandits

no code implementations • 23 Jul 2013 • Paul Reverdy, Vaibhav Srivastava, Naomi E. Leonard

We develop the upper credible limit (UCL) algorithm for the standard multi-armed bandit problem and show that this deterministic algorithm achieves logarithmic cumulative expected regret, which is optimal performance for uninformative priors.

Bayesian Inference Decision Making +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.