Search Results for author: Weina Wang

Found 8 papers, 1 papers with code

When is exponential asymptotic optimality achievable in average-reward restless bandits?

no code implementations28 May 2024 Yige Hong, Qiaomin Xie, Yudong Chen, Weina Wang

We show that our policy is asymptotically optimal with an $O(\exp(-C N))$ optimality gap for an $N$-armed problem, under the mild assumptions of aperiodic-unichain, non-degeneracy, and local stability.

Efficient Reinforcement Learning for Routing Jobs in Heterogeneous Queueing Systems

no code implementations2 Feb 2024 Neharika Jali, Guannan Qu, Weina Wang, Gauri Joshi

Unlike homogeneous systems, a threshold policy, that routes jobs to the slow server(s) when the queue length exceeds a certain threshold, is known to be optimal for the one-fast-one-slow two-server system.

reinforcement-learning Reinforcement Learning (RL)

Job Dispatching Policies for Queueing Systems with Unknown Service Rates

no code implementations8 Jun 2021 Tuhinangshu Choudhury, Gauri Joshi, Weina Wang, Sanjay Shakkottai

In multi-server queueing systems where there is no central queue holding all incoming jobs, job dispatching policies are used to assign incoming jobs to the queue at one of the servers.

On the Privacy-Utility Tradeoff in Peer-Review Data Analysis

no code implementations29 Jun 2020 Wenxin Ding, Nihar B. Shah, Weina Wang

The crux of the framework lies in recognizing that a part of the data pertaining to the reviews is already available in public, and we use this information to post-process the data released by any privacy mechanism in a manner that improves the accuracy (utility) of the data while retaining the privacy guarantees.

Privacy Preserving

QuickStop: A Markov Optimal Stopping Approach for Quickest Misinformation Detection

no code implementations4 Mar 2019 Honghao Wei, Xiaohan Kang, Weina Wang, Lei Ying

The algorithm consists of an offline machine learning algorithm for learning the probabilistic information spreading model and an online optimal stopping algorithm to detect misinformation.

Misinformation

Almost Boltzmann Exploration

no code implementations25 Jan 2019 Harsh Gupta, Seo Taek Kong, R. Srikant, Weina Wang

In this paper, we show that a simple modification to Boltzmann exploration, motivated by a variation of the standard doubling trick, achieves $O(K\log^{1+\alpha} T)$ regret for a stochastic MAB problem with $K$ arms, where $\alpha>0$ is a parameter of the algorithm.

Multi-Armed Bandits

Cannot find the paper you are looking for? You can Submit a new open access paper.