Search Results for author: Huasen Wu

Found 6 papers, 1 papers with code

Should I send this notification? Optimizing push notifications decision making by modeling the future

no code implementations • 17 Feb 2022 • Conor O'Brien, Huasen Wu, Shaodan Zhai, Dalin Guo, Wenzhe Shi, Jonathan J Hunt

In this work we focus on mobile push notifications, where the long term effects of recommender system decisions can be particularly strong.

Decision Making Model-based Reinforcement Learning +2

Paper
Add Code

Learning to Rank For Push Notifications Using Pairwise Expected Regret

no code implementations • 19 Jan 2022 • Yuguang Yue, Yuanpu Xie, Huasen Wu, Haofeng Jia, Shaodan Zhai, Wenzhe Shi, Jonathan J Hunt

Listwise ranking losses have been widely studied in recommender systems.

Learning-To-Rank Recommendation Systems

Paper
Add Code

Waiting but not Aging: Optimizing Information Freshness Under the Pull Model

no code implementations • 17 Dec 2019 • Fengjiao Li, Yu Sang, Zhongdong Liu, Bin Li, Huasen Wu, Bo Ji

Interestingly, we find that under this new Pull model, replication schemes capture a novel tradeoff between different values of the AoI across the servers (due to the random updating processes) and different response times across the servers, which can be exploited to minimize the expected AoI at the user's side.

Paper
Add Code

Adaptive Exploration-Exploitation Tradeoff for Opportunistic Bandits

no code implementations • ICML 2018 • Huasen Wu, Xueying Guo, Xin Liu

When the load/price is low, so is the cost/regret of pulling a suboptimal arm (e. g., trying a suboptimal network configuration).

Thompson Sampling

Paper
Add Code

Double Thompson Sampling for Dueling Bandits

1 code implementation • NeurIPS 2016 • Huasen Wu, Xin Liu

This simple algorithm applies to general Copeland dueling bandits, including Condorcet dueling bandits as its special case.

Thompson Sampling

Paper
Code

Algorithms with Logarithmic or Sublinear Regret for Constrained Contextual Bandits

no code implementations • NeurIPS 2015 • Huasen Wu, R. Srikant, Xin Liu, Chong Jiang

To the best of our knowledge, this is the first work that shows how to achieve logarithmic regret in constrained contextual bandits.

Multi-Armed Bandits

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.