no code implementations • 29 Jun 2023 • Akanksha Sneh, Sumit Darak, Shobha Sundar Ram, Manjesh Hanawal
Multi-arm bandit (MAB) algorithms have been used to learn optimal beams for millimeter wave communication systems.
no code implementations • 18 Oct 2016 • Manjesh Hanawal, Csaba Szepesvari, Venkatesh Saligrama
We reduce USS to a special case of multi-armed bandit problem with side information and develop polynomial time algorithms that achieve sublinear regret.