Search Results for author: Brendan Bennett

Found 4 papers, 1 papers with code

Back to Square One: Superhuman Performance in Chutes and Ladders Through Deep Neural Networks and Tree Search

1 code implementation1 Apr 2021 Dylan Ashley, Anssi Kanervisto, Brendan Bennett

We present AlphaChute: a state-of-the-art algorithm that achieves superhuman performance in the ancient game of Chutes and Ladders.

Incrementally Learning Functions of the Return

no code implementations5 Jul 2019 Brendan Bennett, Wesley Chung, Muhammad Zaheer, Vincent Liu

Temporal difference methods enable efficient estimation of value functions in reinforcement learning in an incremental fashion, and are of broader interest because they correspond learning as observed in biological systems.

Reinforcement Learning (RL)

Predicting Periodicity with Temporal Difference Learning

no code implementations20 Sep 2018 Kristopher De Asis, Brendan Bennett, Richard S. Sutton

Temporal difference (TD) learning is an important approach in reinforcement learning, as it combines ideas from dynamic programming and Monte Carlo methods in a way that allows for online and incremental model-free learning.

Decision Making

Cannot find the paper you are looking for? You can Submit a new open access paper.