Search Results for author: Mahammad Humayoo

Found 2 papers, 0 papers with code

Parameter Estimation with the Ordered $\ell_{2}$ Regularization via an Alternating Direction Method of Multipliers

no code implementations4 Sep 2019 Mahammad Humayoo, Xue-Qi Cheng

The reason stems from the fact that the ordered regularization can reject irrelevant variables and yield an accurate estimation of the parameters.

Relative Importance Sampling For Off-Policy Actor-Critic in Deep Reinforcement Learning

no code implementations30 Oct 2018 Mahammad Humayoo, Xue-Qi Cheng

One reason for the instability of off-policy learning is a discrepancy between the target ($\pi$) and behavior (b) policy distributions.

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.