Search Results for author: Junyu Cao

Found 5 papers, 1 papers with code

A Probabilistic Approach for Alignment with Human Comparisons

no code implementations • 16 Mar 2024 • Junyu Cao, Mohsen Bayati

To bridge this gap, this paper studies the effective use of human comparisons to address limitations arising from noisy data and high-dimensional models.

Paper
Add Code

Speed Up the Cold-Start Learning in Two-Sided Bandits with Many Arms

no code implementations • 1 Oct 2022 • Mohsen Bayati, Junyu Cao, Wanning Chen

Next, we design two-phase bandit algorithms that first use subsampling and low-rank matrix estimation to obtain a substantially smaller targeted set of products and then apply a UCB procedure on the target products to find the best one.

Paper
Add Code

Fatigue-aware Bandits for Dependent Click Models

no code implementations • 22 Aug 2020 • Junyu Cao, Wei Sun, Zuo-Jun, Shen, Markus Ettl

Based on user's feedback, the platform learns the relevance of the underlying content as well as the discounting effect due to content fatigue.

Recommendation Systems

Paper
Add Code

Dynamic Learning with Frequent New Product Launches: A Sequential Multinomial Logit Bandit Problem

no code implementations • 29 Apr 2019 • Junyu Cao, Wei Sun

Motivated by the phenomenon that companies introduce new products to keep abreast with customers' rapidly changing tastes, we consider a novel online learning setting where a profit-maximizing seller needs to learn customers' preferences through offering recommendations, which may contain existing products and new products that are launched in the middle of a selling period.

Product Recommendation

Paper
Add Code

Dynamic Learning of Sequential Choice Bandit Problem under Marketing Fatigue

1 code implementation • 19 Mar 2019 • Junyu Cao, Wei Sun

Based on user feedback, the platform dynamically learns users' abandonment distribution and their valuations of messages to determine the length of the sequence and the order of the messages, while maximizing the cumulative payoff over a horizon of length T. We refer to this online learning task as the sequential choice bandit problem.

Combinatorial Optimization Marketing

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.