Search Results for author: Chengshuai Shi

Found 14 papers, 5 papers with code

Best Arm Identification for Prompt Learning under a Limited Budget

1 code implementation • 15 Feb 2024 • Chengshuai Shi, Kun Yang, Jing Yang, Cong Shen

Based on this connection, a general framework TRIPLE (besT aRm Identification for Prompt LEarning) is proposed to harness the power of BAI-FB in prompt learning systematically.

Instruction Following Multi-Armed Bandits

Paper
Code

Harnessing the Power of Federated Learning in Federated Contextual Bandits

no code implementations • 26 Dec 2023 • Chengshuai Shi, Ruida Zhou, Kun Yang, Cong Shen

Federated learning (FL) has demonstrated great potential in revolutionizing distributed machine learning, and tremendous efforts have been made to extend it beyond the original focus on supervised learning.

Decision Making Federated Learning +1

Paper
Add Code

Provably Efficient Offline Reinforcement Learning with Perturbed Data Sources

no code implementations • 14 Jun 2023 • Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang

Then, a novel HetPEVI algorithm is proposed, which simultaneously considers the sample uncertainties from a finite number of data samples per data source and the source uncertainties due to a finite number of available data sources.

Offline RL reinforcement-learning +1

Paper
Add Code

On High-dimensional and Low-rank Tensor Bandits

no code implementations • 6 May 2023 • Chengshuai Shi, Cong Shen, Nicholas D. Sidiropoulos

To address this limitation, this work studies a general tensor bandits model, where actions and system parameters are represented by tensors as opposed to vectors, and we particularly focus on the case that the unknown system tensor is low-rank.

Recommendation Systems Vocal Bursts Intensity Prediction

Paper
Add Code

Reward Teaching for Federated Multi-armed Bandits

no code implementations • 3 May 2023 • Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang

Rigorous analyses demonstrate that when facing clients with UCB1, TWL outperforms TAL in terms of the dependencies on sub-optimality gaps thanks to its adaptive design.

Multi-Armed Bandits

Paper
Add Code

A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games

no code implementations • 4 Oct 2022 • Wei Xiong, Han Zhong, Chengshuai Shi, Cong Shen, Tong Zhang

Existing studies on provably efficient algorithms for Markov games (MGs) almost exclusively build on the "optimism in the face of uncertainty" (OFU) principle.

Paper
Add Code

Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game

no code implementations • 31 May 2022 • Wei Xiong, Han Zhong, Chengshuai Shi, Cong Shen, LiWei Wang, Tong Zhang

We also extend our techniques to the two-player zero-sum Markov games (MGs), and establish a new performance lower bound for MGs, which tightens the existing result, and verifies the nearly minimax optimality of the proposed algorithm.

Offline RL Reinforcement Learning (RL)

Paper
Add Code

Heterogeneous Multi-player Multi-armed Bandits: Closing the Gap and Generalization

1 code implementation • NeurIPS 2021 • Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang

In this paper, we propose BEACON -- Batched Exploration with Adaptive COmmunicatioN -- that closes this gap.

Efficient Exploration Multi-Armed Bandits

Paper
Code

(Almost) Free Incentivized Exploration from Decentralized Learning Agents

1 code implementation • NeurIPS 2021 • Chengshuai Shi, Haifeng Xu, Wei Xiong, Cong Shen

In this work, we break this barrier and study incentivized exploration with multiple and long-term strategic agents, who have more complicated behaviors that often appear in real-world applications.

Multi-Armed Bandits

Paper
Code

Multi-player Multi-armed Bandits with Collision-Dependent Reward Distributions

no code implementations • 25 Jun 2021 • Chengshuai Shi, Cong Shen

We study a new stochastic multi-player multi-armed bandits (MP-MAB) problem, where the reward distribution changes if a collision occurs on the arm.

Multi-Armed Bandits

Paper
Add Code

Federated Multi-armed Bandits with Personalization

1 code implementation • 25 Feb 2021 • Chengshuai Shi, Cong Shen, Jing Yang

A general framework of personalized federated multi-armed bandits (PF-MAB) is proposed, which is a new bandit paradigm analogous to the federated learning (FL) framework in supervised learning and enjoys the features of FL with personalization.

Federated Learning Multi-Armed Bandits

Paper
Code

Federated Multi-Armed Bandits

1 code implementation • 28 Jan 2021 • Chengshuai Shi, Cong Shen

We first study the approximate model where the heterogeneous local models are random realizations of the global model from an unknown distribution.

Federated Learning Multi-Armed Bandits +1

Paper
Code

On No-Sensing Adversarial Multi-player Multi-armed Bandits with Collision Communications

no code implementations • 2 Nov 2020 • Chengshuai Shi, Cong Shen

Instead of focusing on the hardness of multiple players, we introduce a new dimension of hardness, called attackability.

Multi-Armed Bandits

Paper
Add Code

Decentralized Multi-player Multi-armed Bandits with No Collision Information

no code implementations • 29 Feb 2020 • Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang

The decentralized stochastic multi-player multi-armed bandit (MP-MAB) problem, where the collision information is not available to the players, is studied in this paper.

Multi-Armed Bandits

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.