Search Results for author: Minhao Shi

Found 1 papers, 0 papers with code

A Unified Approach for Multi-step Temporal-Difference Learning with Eligibility Traces in Reinforcement Learning

no code implementations • 9 Feb 2018 • Long Yang, Minhao Shi, Qian Zheng, Wenjia Meng, Gang Pan

Results show that, with an intermediate value of $\sigma$, $Q(\sigma ,\lambda)$ creates a mixture of the existing algorithms that can learn the optimal value significantly faster than the extreme end ($\sigma=0$, or $1$).

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.