Search Results for author: Sheng Han

Found 3 papers, 0 papers with code

Conservative Distributional Reinforcement Learning with Safety Constraints

no code implementations18 Jan 2022 Hengrui Zhang, Youfang Lin, Sheng Han, Shuo Wang, Kai Lv

Then, CDMPO uses a conservative value function loss to reduce the number of violations of constraints during the exploration process.

Distributional Reinforcement Learning reinforcement-learning +1

$l_1$-regularized Outlier Isolation and Regression

no code implementations1 Jun 2014 Sheng Han, Suzhen Wang, Xinyu Wu

This paper proposed a new regression model called $l_1$-regularized outlier isolation and regression (LOIRE) and a fast algorithm based on block coordinate descent to solve this model.

regression

Cannot find the paper you are looking for? You can Submit a new open access paper.