Search Results for author: Hado P. Van Hasselt

Found 2 papers, 0 papers with code

Natural Value Approximators: Learning when to Trust Past Estimates

no code implementations • NeurIPS 2017 • Zhongwen Xu, Joseph Modayil, Hado P. Van Hasselt, Andre Barreto, David Silver, Tom Schaul

Neural networks have a smooth initial inductive bias, such that small changes in input do not lead to large changes in output.

Atari Games Inductive Bias +2

Paper
Add Code

Weighted importance sampling for off-policy learning with linear function approximation

no code implementations • NeurIPS 2014 • A. Rupam Mahmood, Hado P. Van Hasselt, Richard S. Sutton

Second, we show that these benefits extend to a new weighted-importance-sampling version of off-policy LSTD(lambda).

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.