Search Results for author: Guojun Xiong

Found 19 papers, 1 papers with code

Finite-Horizon Single-Pull Restless Bandits: An Efficient Index Policy For Scarce Resource Allocation

no code implementations10 Jan 2025 Guojun Xiong, Haichuan Wang, Yuqi Pan, Saptarshi Mandal, Sanket Shah, Niclas Boehmer, Milind Tambe

However, in many practical settings with highly scarce resources, where each agent can only receive at most one resource, such as healthcare intervention programs, the standard RMAB framework falls short.

Multi-Armed Bandits

INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent

no code implementations24 Dec 2024 Haohang Li, Yupeng Cao, Yangyang Yu, Shashidhar Reddy Javaji, Zhiyang Deng, Yueru He, Yuechen Jiang, Zining Zhu, Koduvayur Subbalakshmi, Guojun Xiong, Jimin Huang, Lingfei Qian, Xueqing Peng, Qianqian Xie, Jordan W. Suchow

Despite this progress, the field currently encounters two main challenges: (1) the lack of a comprehensive LLM agent framework adaptable to a variety of financial tasks, and (2) the absence of standardized benchmarks and consistent datasets for assessing agent performance.

Decision Making Language Modeling +2

On the Linear Speedup of Personalized Federated Reinforcement Learning with Shared Representations

no code implementations22 Nov 2024 Guojun Xiong, Shufan Wang, Daniel Jiang, Jian Li

In this paper, we take a further step and introduce a \emph{personalized} FedRL framework (PFedRL) by taking advantage of possibly shared common structure among agents in heterogeneous environments.

DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback

no code implementations7 Oct 2024 Guojun Xiong, Ujwal Dinesha, Debajoy Mukherjee, Jian Li, Srinivas Shakkottai

In this paper, we introduce Pref-RMAB, a new RMAB model in the presence of preference signals, where the decision maker only observes pairwise preference feedback rather than scalar reward from the activated arms at each decision epoch.

Multi-Armed Bandits Sequential Decision Making

Decentralized Federated Learning with Model Caching on Mobile Agents

no code implementations26 Aug 2024 Xiaoyu Wang, Guojun Xiong, Houwei Cao, Jian Li, Yong liu

Decentralized FL (DFL) utilizes local model exchange and aggregation between agents to reduce the communication and computation overheads on the central server.

Federated Learning

Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback

no code implementations2 May 2024 Guojun Xiong, Jian Li

Restless multi-armed bandits (RMAB) play a central role in modeling sequential decision making problems under an instantaneous activation constraint that at most B arms can be activated at any decision epoch.

Multi-Armed Bandits Sequential Decision Making

Structured Reinforcement Learning for Delay-Optimal Data Transmission in Dense mmWave Networks

no code implementations25 Apr 2024 Shufan Wang, Guojun Xiong, Shichen Zhang, Huacheng Zeng, Jian Li, Shivendra Panwar

We study the data packet transmission problem (mmDPT) in dense cell-free millimeter wave (mmWave) networks, i. e., users sending data packet requests to access points (APs) via uplinks and APs transmitting requested data packets to users via downlinks.

Fairness Multi-Armed Bandits +1

FinBen: A Holistic Financial Benchmark for Large Language Models

2 code implementations20 Feb 2024 Qianqian Xie, Weiguang Han, Zhengyu Chen, Ruoyu Xiang, Xiao Zhang, Yueru He, Mengxi Xiao, Dong Li, Yongfu Dai, Duanyu Feng, Yijing Xu, Haoqiang Kang, Ziyan Kuang, Chenhan Yuan, Kailai Yang, Zheheng Luo, Tianlin Zhang, Zhiwei Liu, Guojun Xiong, Zhiyang Deng, Yuechen Jiang, Zhiyuan Yao, Haohang Li, Yangyang Yu, Gang Hu, Jiajia Huang, Xiao-Yang Liu, Alejandro Lopez-Lira, Benyou Wang, Yanzhao Lai, Hao Wang, Min Peng, Sophia Ananiadou, Jimin Huang

Our evaluation of 15 representative LLMs, including GPT-4, ChatGPT, and the latest Gemini, reveals several key findings: While LLMs excel in IE and textual analysis, they struggle with advanced reasoning and complex tasks like text generation and forecasting.

Question Answering RAG +2

DePRL: Achieving Linear Convergence Speedup in Personalized Decentralized Learning with Shared Representations

no code implementations17 Dec 2023 Guojun Xiong, Gang Yan, Shiqiang Wang, Jian Li

Decentralized learning has emerged as an alternative method to the popular parameter-server framework which suffers from high communication burden, single-point failure and scalability issues due to the need of a central server.

Learning Theory Representation Learning

Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints

no code implementations16 Dec 2023 Shufan Wang, Guojun Xiong, Jian Li

Restless multi-armed bandits (RMAB) have been widely used to model sequential decision making problems with constraints.

Decision Making Fairness +3

Straggler-Resilient Decentralized Learning via Adaptive Asynchronous Updates

no code implementations11 Jun 2023 Guojun Xiong, Gang Yan, Shiqiang Wang, Jian Li

With the increasing demand for large-scale training of machine learning models, fully decentralized optimization methods have recently been advocated as alternatives to the popular parameter server framework.

Decentralized Stochastic Multi-Player Multi-Armed Walking Bandits

no code implementations12 Dec 2022 Guojun Xiong, Jian Li

Most research for this problem focuses exclusively on the settings that players have \textit{full access} to all arms and receive no reward when pulling the same arm.

Decision Making Distributed Optimization

Whittle Index based Q-Learning for Wireless Edge Caching with Linear Function Approximation

no code implementations26 Feb 2022 Guojun Xiong, Shufan Wang, Jian Li, Rahul Singh

Using this structural result, we establish the indexability of our problem, and employ the Whittle index policy to minimize average latency.

Edge-computing Q-Learning +1

Straggler-Resilient Distributed Machine Learning with Dynamic Backup Workers

no code implementations11 Feb 2021 Guojun Xiong, Gang Yan, Rahul Singh, Jian Li

In this paradigm, each worker maintains a local estimate of the optimal parameter vector, and iteratively updates it by waiting and averaging all estimates obtained from its neighbors, and then corrects it on the basis of its local dataset.

BIG-bench Machine Learning Distributed Optimization

Learning Augmented Index Policy for Optimal Service Placement at the Network Edge

no code implementations10 Jan 2021 Guojun Xiong, Rahul Singh, Jian Li

We pose the problem as a Markov decision process (MDP) in which the system state is given by describing, for each service, the number of customers that are currently waiting at the edge to obtain the service.

Q-Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.