no code implementations • 13 May 2014 • Manel Tagorti, Bruno Scherrer
We consider LSTD($\lambda$), the least-squares temporal-difference algorithm with eligibility traces algorithm proposed by Boyan (2002).