no code implementations • NeurIPS 2017 • Ervin Tanczos, Robert Nowak, Bob Mankoff
This paper focuses on best-arm identification in multi-armed bandits with bounded rewards.
Multi-Armed Bandits