Search Results for author: Minhao Shi

Found 1 papers, 0 papers with code

A Unified Approach for Multi-step Temporal-Difference Learning with Eligibility Traces in Reinforcement Learning

no code implementations9 Feb 2018 Long Yang, Minhao Shi, Qian Zheng, Wenjia Meng, Gang Pan

Results show that, with an intermediate value of $\sigma$, $Q(\sigma ,\lambda)$ creates a mixture of the existing algorithms that can learn the optimal value significantly faster than the extreme end ($\sigma=0$, or $1$).

Cannot find the paper you are looking for? You can Submit a new open access paper.