no code implementations • 30 Apr 2023 • Fathima Zarin Faizal, Adway Girish, Manjesh Kumar Hanawal, Nikhil Karamchandani
We study the problem of best-arm identification in a distributed variant of the multi-armed bandit setting, with a central learner and multiple agents.
no code implementations • 27 Nov 2022 • Fathima Zarin Faizal, Jayakrishnan Nair
A key feature of this algorithm is that it is designed on the basis of an information theoretic lower bound for two-armed instances.