1 code implementation • IEEE Conference on Computer Communications 2021 • Guoju Gao, He Huang, Mingjun Xiao, Jie Wu, Yu-E Sun, Sheng Zhang
The multi-armed bandit (MAB) model has been deeply studied to solve many online learning problems, such as rate allocation in communication networks, Ad recommendation in social networks, etc.