no code implementations • 29 Feb 2024 • Ethan Blaser, Chuanhao Li, Hongning Wang
The demand for collaborative and private bandit learning across multiple agents is surging due to the growing quantity of data generated from distributed systems.
no code implementations • 7 Feb 2024 • Zhepei Wei, Chuanhao Li, Tianze Ren, Haifeng Xu, Hongning Wang
To enhance the efficiency and practicality of federated bandit learning, recent advances have introduced incentives to motivate communication among clients, where a client participates only when the incentive offered by the server outweighs its participation cost.
no code implementations • 3 Nov 2023 • Chuanhao Li, Chong Liu, Yu-Xiang Wang
Federated optimization studies the problem of collaborative function optimization among multiple clients (e. g. mobile devices or organizations) under the coordination of a central server.
no code implementations • 17 Oct 2023 • Zichen Wang, Chuanhao Li, Chenyu Song, Lianghui Wang, Quanquan Gu, Huazheng Wang
We study the federated pure exploration problem of multi-armed bandits and linear bandits, where $M$ agents cooperatively identify the best arm via communicating with the central server.
no code implementations • 3 Feb 2023 • Fan Yao, Chuanhao Li, Denis Nekipelov, Hongning Wang, Haifeng Xu
Content creators compete for exposure on recommendation platforms, and such strategic behavior leads to a dynamic shift over the content distribution.
1 code implementation • CVPR 2023 • Chuanhao Li, Zhen Li, Chenchen Jing, Yunde Jia, Yuwei Wu
Compositional generalization is critical to simulate the compositional capability of humans, and has received much attention in the vision-and-language (V&L) community.
no code implementations • 10 Jun 2022 • Chuanhao Li, Huazheng Wang, Mengdi Wang, Hongning Wang
We tackle the communication efficiency challenge of learning kernelized contextual bandits in a distributed setting.
no code implementations • 3 Feb 2022 • Fan Yao, Chuanhao Li, Denis Nekipelov, Hongning Wang, Haifeng Xu
In real-world recommendation problems, especially those with a formidably large item space, users have to gradually learn to estimate the utility of any fresh recommendations from their experience about previously consumed items.
no code implementations • 2 Feb 2022 • Chuanhao Li, Hongning Wang
Contextual bandit algorithms have been recently studied under the federated learning setting to satisfy the demand of keeping data decentralized and pushing the learning of bandit models to the client side.
no code implementations • 6 Oct 2021 • Fan Yao, Chuanhao Li, Denis Nekipelov, Hongning Wang, Haifeng Xu
We propose a new problem setting to study the sequential interactions between a recommender system and a user.
no code implementations • 4 Oct 2021 • Chuanhao Li, Hongning Wang
In this paper, we study linear contextual bandit in a federated learning setting.
no code implementations • 14 Apr 2021 • Chuanhao Li, Qingyun Wu, Hongning Wang
However, all existing collaborative bandit learning solutions impose a stationary assumption about the environment, i. e., both user preferences and the dependency among users are assumed static over time.
no code implementations • 8 Apr 2021 • Huazheng Wang, Haifeng Xu, Chuanhao Li, Zhiyuan Liu, Hongning Wang
We study the problem of incentivizing exploration for myopic users in linear bandits, where the users tend to exploit arm with the highest predicted reward instead of exploring.
no code implementations • 5 Sep 2020 • Chuanhao Li, Qingyun Wu, Hongning Wang
Non-stationary bandits and online clustering of bandits lift the restrictive assumptions in contextual bandits and provide solutions to many important real-world scenarios.