no code implementations • 30 May 2023 • Ronshee Chawla, Daniel Vial, Sanjay Shakkottai, R. Srikant
The study of collaborative multi-agent bandits has attracted significant attention recently.
no code implementations • 2 Jul 2020 • Ronshee Chawla, Abishek Sankararaman, Sanjay Shakkottai
We study a multi-agent stochastic linear bandit with side information, parameterized by an unknown vector $\theta^* \in \mathbb{R}^d$.
no code implementations • 15 Jan 2020 • Ronshee Chawla, Abishek Sankararaman, Ayalvadi Ganesh, Sanjay Shakkottai
Agents use the communication medium to recommend only arm-IDs (not samples), and thus update the set of arms from which they play.