Search Results for author: Alex Nikulkov

Found 4 papers, 1 papers with code

Pearl: A Production-ready Reinforcement Learning Agent

1 code implementation • 6 Dec 2023 • Zheqing Zhu, Rodrigo de Salvo Braz, Jalaj Bhandari, Daniel Jiang, Yi Wan, Yonathan Efroni, Liyuan Wang, Ruiyang Xu, Hongbo Guo, Alex Nikulkov, Dmytro Korenkevych, Urun Dogan, Frank Cheng, Zheng Wu, Wanqiao Xu

Reinforcement Learning (RL) offers a versatile framework for achieving long-term goals.

reinforcement-learning Reinforcement Learning (RL)

2,385

Paper
Code

Offline Reinforcement Learning for Optimizing Production Bidding Policies

no code implementations • 13 Oct 2023 • Dmytro Korenkevych, Frank Cheng, Artsiom Balakir, Alex Nikulkov, Lingnan Gao, Zhihao Cen, Zuobing Xu, Zheqing Zhu

We use a hybrid agent architecture that combines arbitrary base policies with deep neural networks, where only the optimized base policy parameters are eventually deployed, and the neural network part is discarded after training.

reinforcement-learning

Paper
Add Code

Optimizing Long-term Value for Auction-Based Recommender Systems via On-Policy Reinforcement Learning

no code implementations • 23 May 2023 • Ruiyang Xu, Jalaj Bhandari, Dmytro Korenkevych, Fan Liu, Yuchen He, Alex Nikulkov, Zheqing Zhu

Auction-based recommender systems are prevalent in online advertising platforms, but they are typically optimized to allocate recommendation slots based on immediate expected return metrics, neglecting the downstream effects of recommendations on user behavior.

Recommendation Systems reinforcement-learning

Paper
Add Code

Evaluating Online Bandit Exploration In Large-Scale Recommender System

no code implementations • 5 Apr 2023 • Hongbo Guo, Ruben Naeff, Alex Nikulkov, Zheqing Zhu

Bandit learning has been an increasingly popular design choice for recommender system.

Fairness Recommendation Systems

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.