Search Results for author: Yusen Huo

Found 3 papers, 1 papers with code

Trajectory-wise Iterative Reinforcement Learning Framework for Auto-bidding

no code implementations23 Feb 2024 Haoming Li, Yusen Huo, Shuai Dou, Zhenzhe Zheng, Zhilin Zhang, Chuan Yu, Jian Xu, Fan Wu

The trained policy can subsequently be deployed for further data collection, resulting in an iterative training framework, which we refer to as iterative offline RL.

Offline RL reinforcement-learning +2

Sustainable Online Reinforcement Learning for Auto-bidding

1 code implementation13 Oct 2022 Zhiyu Mou, Yusen Huo, Rongquan Bai, Mingzhou Xie, Chuan Yu, Jian Xu, Bo Zheng

Due to safety concerns, it was believed that the RL training process can only be carried out in an offline virtual advertising system (VAS) that is built based on the historical data generated in the RAS.

Q-Learning reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.