Search Results for author: Honglei Yin

Found 3 papers, 1 papers with code

Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations

2 code implementations20 Jul 2022 Haoran Xu, Xianyuan Zhan, Honglei Yin, Huiling Qin

We study the problem of offline Imitation Learning (IL) where an agent aims to learn an optimal expert behavior policy without additional online environment interactions.

Imitation Learning Offline RL +1

Offline Reinforcement Learning with Soft Behavior Regularization

no code implementations14 Oct 2021 Haoran Xu, Xianyuan Zhan, Jianxiong Li, Honglei Yin

In this work, we start from the performance difference between the learned policy and the behavior policy, we derive a new policy learning objective that can be used in the offline setting, which corresponds to the advantage function value of the behavior policy, multiplying by a state-marginal density ratio.

Continuous Control reinforcement-learning +1

DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning

no code implementations23 Feb 2021 Xianyuan Zhan, Haoran Xu, Yue Zhang, Xiangyu Zhu, Honglei Yin, Yu Zheng

Optimizing the combustion efficiency of a thermal power generating unit (TPGU) is a highly challenging and critical task in the energy industry.

Continuous Control Offline RL +2

Cannot find the paper you are looking for? You can Submit a new open access paper.