no code implementations • 3 Mar 2020 • Peter Landgren, Vaibhav Srivastava, Naomi Ehrich Leonard
And we consider a constrained reward model in which agents that choose the same arm at the same time receive no reward.
no code implementations • 2 Jun 2016 • Peter Landgren, Vaibhav Srivastava, Naomi Ehrich Leonard
We study distributed cooperative decision-making under the explore-exploit tradeoff in the multiarmed bandit (MAB) problem.
no code implementations • 21 Dec 2015 • Peter Landgren, Vaibhav Srivastava, Naomi Ehrich Leonard
We study the explore-exploit tradeoff in distributed cooperative decision-making using the context of the multiarmed bandit (MAB) problem.