Search Results for author: Lex Weaver

Found 2 papers, 0 papers with code

The Optimal Reward Baseline for Gradient-Based Reinforcement Learning

no code implementations10 Jan 2013 Lex Weaver, Nigel Tao

There exist a number of reinforcement learning algorithms which learnby climbing the gradient of expected reward.

reinforcement-learning Reinforcement Learning (RL)

KnightCap: A chess program that learns by combining TD(lambda) with game-tree search

no code implementations10 Jan 1999 Jonathan Baxter, Andrew Tridgell, Lex Weaver

In this paper we present TDLeaf(lambda), a variation on the TD(lambda) algorithm that enables it to be used in conjunction with game-tree search.

Cannot find the paper you are looking for? You can Submit a new open access paper.