1 code implementation • 15 Feb 2024 • Chengshuai Shi, Kun Yang, Jing Yang, Cong Shen
Based on this connection, a general framework TRIPLE (besT aRm Identification for Prompt LEarning) is proposed to harness the power of BAI-FB in prompt learning systematically.
no code implementations • 26 Dec 2023 • Chengshuai Shi, Ruida Zhou, Kun Yang, Cong Shen
Federated learning (FL) has demonstrated great potential in revolutionizing distributed machine learning, and tremendous efforts have been made to extend it beyond the original focus on supervised learning.
no code implementations • 14 Jun 2023 • Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang
Then, a novel HetPEVI algorithm is proposed, which simultaneously considers the sample uncertainties from a finite number of data samples per data source and the source uncertainties due to a finite number of available data sources.
no code implementations • 6 May 2023 • Chengshuai Shi, Cong Shen, Nicholas D. Sidiropoulos
To address this limitation, this work studies a general tensor bandits model, where actions and system parameters are represented by tensors as opposed to vectors, and we particularly focus on the case that the unknown system tensor is low-rank.
no code implementations • 3 May 2023 • Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang
Rigorous analyses demonstrate that when facing clients with UCB1, TWL outperforms TAL in terms of the dependencies on sub-optimality gaps thanks to its adaptive design.
no code implementations • 4 Oct 2022 • Wei Xiong, Han Zhong, Chengshuai Shi, Cong Shen, Tong Zhang
Existing studies on provably efficient algorithms for Markov games (MGs) almost exclusively build on the "optimism in the face of uncertainty" (OFU) principle.
no code implementations • 31 May 2022 • Wei Xiong, Han Zhong, Chengshuai Shi, Cong Shen, LiWei Wang, Tong Zhang
We also extend our techniques to the two-player zero-sum Markov games (MGs), and establish a new performance lower bound for MGs, which tightens the existing result, and verifies the nearly minimax optimality of the proposed algorithm.
1 code implementation • NeurIPS 2021 • Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang
In this paper, we propose BEACON -- Batched Exploration with Adaptive COmmunicatioN -- that closes this gap.
1 code implementation • NeurIPS 2021 • Chengshuai Shi, Haifeng Xu, Wei Xiong, Cong Shen
In this work, we break this barrier and study incentivized exploration with multiple and long-term strategic agents, who have more complicated behaviors that often appear in real-world applications.
no code implementations • 25 Jun 2021 • Chengshuai Shi, Cong Shen
We study a new stochastic multi-player multi-armed bandits (MP-MAB) problem, where the reward distribution changes if a collision occurs on the arm.
1 code implementation • 25 Feb 2021 • Chengshuai Shi, Cong Shen, Jing Yang
A general framework of personalized federated multi-armed bandits (PF-MAB) is proposed, which is a new bandit paradigm analogous to the federated learning (FL) framework in supervised learning and enjoys the features of FL with personalization.
1 code implementation • 28 Jan 2021 • Chengshuai Shi, Cong Shen
We first study the approximate model where the heterogeneous local models are random realizations of the global model from an unknown distribution.
no code implementations • 2 Nov 2020 • Chengshuai Shi, Cong Shen
Instead of focusing on the hardness of multiple players, we introduce a new dimension of hardness, called attackability.
no code implementations • 29 Feb 2020 • Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang
The decentralized stochastic multi-player multi-armed bandit (MP-MAB) problem, where the collision information is not available to the players, is studied in this paper.