no code implementations • 14 Feb 2024 • Ilja Kuzborskij, Kwang-Sung Jun, Yulian Wu, Kyoungseok Jang, Francesco Orabona
In this paper, we consider the problem of proving concentration inequalities to estimate the mean of the sequence.
no code implementations • 1 Jun 2023 • Yulian Wu, Xingyu Zhou, Sayak Ray Chowdhury, Di Wang
Under each framework, we consider both joint differential privacy (JDP) and local differential privacy (LDP) models.
no code implementations • 16 Feb 2023 • Bhargav Ganguly, Yulian Wu, Di Wang, Vaneet Aggarwal
This improvement is a key to the significant regret improvement in quantum reinforcement learning.
no code implementations • 23 Jan 2023 • Yulian Wu, Chaowen Guan, Vaneet Aggarwal, Di Wang
In this paper, we study multi-armed bandits (MAB) and stochastic linear bandits (SLB) with heavy-tailed rewards and quantum reward oracle.
no code implementations • 4 Jun 2021 • Youming Tao, Yulian Wu, Peng Zhao, Di Wang
Finally, we establish the lower bound to show that the instance-dependent regret of our improved algorithm is optimal.