no code implementations • 23 Jan 2023 • Yulian Wu, Chaowen Guan, Vaneet Aggarwal, Di Wang
In this paper, we study multi-armed bandits (MAB) and stochastic linear bandits (SLB) with heavy-tailed rewards and quantum reward oracle.
no code implementations • 19 Oct 2020 • Di Wang, Xiangyu Guo, Chaowen Guan, Shi Li, Jinhui Xu
To the best of our knowledge, this is the first work that studies and provides theoretical guarantees for the stochastic linear combination of non-linear regressions model.