1 code implementation • 14 Jul 2020 • Daochen Wang, Xuchen You, Tongyang Li, Andrew M. Childs
Identifying the best arm of a multi-armed bandit is a central problem in bandit optimization.
Multi-Armed Bandits