Search Results for author: Yusen Huo

Found 3 papers, 1 papers with code

Trajectory-wise Iterative Reinforcement Learning Framework for Auto-bidding

no code implementations • 23 Feb 2024 • Haoming Li, Yusen Huo, Shuai Dou, Zhenzhe Zheng, Zhilin Zhang, Chuan Yu, Jian Xu, Fan Wu

The trained policy can subsequently be deployed for further data collection, resulting in an iterative training framework, which we refer to as iterative offline RL.

Offline RL reinforcement-learning +2

Paper
Add Code

Sustainable Online Reinforcement Learning for Auto-bidding

1 code implementation • 13 Oct 2022 • Zhiyu Mou, Yusen Huo, Rongquan Bai, Mingzhou Xie, Chuan Yu, Jian Xu, Bo Zheng

Due to safety concerns, it was believed that the RL training process can only be carried out in an offline virtual advertising system (VAS) that is built based on the historical data generated in the RAS.

Q-Learning reinforcement-learning +1

Paper
Code

Tensor-based Cooperative Control for Large Scale Multi-intersection Traffic Signal Using Deep Reinforcement Learning and Imitation Learning

no code implementations • 30 Sep 2019 • Yusen Huo, Qinghua Tao, Jianming Hu

In the proposed model, a multi-task learning structure is used to get the cooperative policy by learning.

Imitation Learning Multi-Task Learning +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.