Search Results for author: Longxiang Shi

Found 2 papers, 0 papers with code

FiDi-RL: Incorporating Deep Reinforcement Learning with Finite-Difference Policy Search for Efficient Learning of Continuous Control

no code implementations • 1 Jul 2019 • Longxiang Shi, Shijian Li, Longbing Cao, Long Yang, Gang Zheng, Gang Pan

Alternatively, derivative-based methods treat the optimization process as a blackbox and show robustness and stability in learning continuous control tasks, but not data efficient in learning.

Continuous Control reinforcement-learning +1

Paper
Add Code

TBQ($σ$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning

no code implementations • 17 May 2019 • Longxiang Shi, Shijian Li, Longbing Cao, Long Yang, Gang Pan

However, existing off-policy learning methods based on probabilistic policy measurement are inefficient when utilizing traces under a greedy target policy, which is ineffective for control problems.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.