Search Results for author: Chengshuai Shi

Found 14 papers, 5 papers with code

Best Arm Identification for Prompt Learning under a Limited Budget

1 code implementation15 Feb 2024 Chengshuai Shi, Kun Yang, Jing Yang, Cong Shen

Based on this connection, a general framework TRIPLE (besT aRm Identification for Prompt LEarning) is proposed to harness the power of BAI-FB in prompt learning systematically.

Instruction Following Multi-Armed Bandits

Harnessing the Power of Federated Learning in Federated Contextual Bandits

no code implementations26 Dec 2023 Chengshuai Shi, Ruida Zhou, Kun Yang, Cong Shen

Federated learning (FL) has demonstrated great potential in revolutionizing distributed machine learning, and tremendous efforts have been made to extend it beyond the original focus on supervised learning.

Decision Making Federated Learning +1

Provably Efficient Offline Reinforcement Learning with Perturbed Data Sources

no code implementations14 Jun 2023 Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang

Then, a novel HetPEVI algorithm is proposed, which simultaneously considers the sample uncertainties from a finite number of data samples per data source and the source uncertainties due to a finite number of available data sources.

Offline RL reinforcement-learning +1

On High-dimensional and Low-rank Tensor Bandits

no code implementations6 May 2023 Chengshuai Shi, Cong Shen, Nicholas D. Sidiropoulos

To address this limitation, this work studies a general tensor bandits model, where actions and system parameters are represented by tensors as opposed to vectors, and we particularly focus on the case that the unknown system tensor is low-rank.

Recommendation Systems Vocal Bursts Intensity Prediction

Reward Teaching for Federated Multi-armed Bandits

no code implementations3 May 2023 Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang

Rigorous analyses demonstrate that when facing clients with UCB1, TWL outperforms TAL in terms of the dependencies on sub-optimality gaps thanks to its adaptive design.

Multi-Armed Bandits

A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games

no code implementations4 Oct 2022 Wei Xiong, Han Zhong, Chengshuai Shi, Cong Shen, Tong Zhang

Existing studies on provably efficient algorithms for Markov games (MGs) almost exclusively build on the "optimism in the face of uncertainty" (OFU) principle.

Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game

no code implementations31 May 2022 Wei Xiong, Han Zhong, Chengshuai Shi, Cong Shen, LiWei Wang, Tong Zhang

We also extend our techniques to the two-player zero-sum Markov games (MGs), and establish a new performance lower bound for MGs, which tightens the existing result, and verifies the nearly minimax optimality of the proposed algorithm.

Offline RL Reinforcement Learning (RL)

(Almost) Free Incentivized Exploration from Decentralized Learning Agents

1 code implementation NeurIPS 2021 Chengshuai Shi, Haifeng Xu, Wei Xiong, Cong Shen

In this work, we break this barrier and study incentivized exploration with multiple and long-term strategic agents, who have more complicated behaviors that often appear in real-world applications.

Multi-Armed Bandits

Multi-player Multi-armed Bandits with Collision-Dependent Reward Distributions

no code implementations25 Jun 2021 Chengshuai Shi, Cong Shen

We study a new stochastic multi-player multi-armed bandits (MP-MAB) problem, where the reward distribution changes if a collision occurs on the arm.

Multi-Armed Bandits

Federated Multi-armed Bandits with Personalization

1 code implementation25 Feb 2021 Chengshuai Shi, Cong Shen, Jing Yang

A general framework of personalized federated multi-armed bandits (PF-MAB) is proposed, which is a new bandit paradigm analogous to the federated learning (FL) framework in supervised learning and enjoys the features of FL with personalization.

Federated Learning Multi-Armed Bandits

Federated Multi-Armed Bandits

1 code implementation28 Jan 2021 Chengshuai Shi, Cong Shen

We first study the approximate model where the heterogeneous local models are random realizations of the global model from an unknown distribution.

Federated Learning Multi-Armed Bandits +1

On No-Sensing Adversarial Multi-player Multi-armed Bandits with Collision Communications

no code implementations2 Nov 2020 Chengshuai Shi, Cong Shen

Instead of focusing on the hardness of multiple players, we introduce a new dimension of hardness, called attackability.

Multi-Armed Bandits

Decentralized Multi-player Multi-armed Bandits with No Collision Information

no code implementations29 Feb 2020 Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang

The decentralized stochastic multi-player multi-armed bandit (MP-MAB) problem, where the collision information is not available to the players, is studied in this paper.

Multi-Armed Bandits

Cannot find the paper you are looking for? You can Submit a new open access paper.