Search Results for author: Craig Sherstan

Found 7 papers, 0 papers with code

Value Function Decomposition for Iterative Design of Reinforcement Learning Agents

no code implementations • 24 Jun 2022 • James Macglashan, Evan Archer, Alisa Devlic, Takuma Seno, Craig Sherstan, Peter R. Wurman, Peter Stone

These value estimates provide insight into an agent's learning and decision-making process and enable new training methods to mitigate common problems.

Decision Making reinforcement-learning +1

Paper
Add Code

Work in Progress: Temporally Extended Auxiliary Tasks

no code implementations • 1 Apr 2020 • Craig Sherstan, Bilal Kartal, Pablo Hernandez-Leal, Matthew E. Taylor

Our overall conclusions are that TD-AE increases the robustness of the A2C algorithm to the trajectory length and while promising, further study is required to fully understand the relationship between auxiliary task prediction timescale and the agent's performance.

Paper
Add Code

Gamma-Nets: Generalizing Value Estimation over Timescale

no code implementations • 18 Nov 2019 • Craig Sherstan, Shibhansh Dohare, James Macglashan, Johannes Günther, Patrick M. Pilarski

By using the timescale as one of the estimator's inputs we can estimate value for arbitrary timescales.

Representation Learning

Paper
Add Code

Accelerating Learning in Constructive Predictive Frameworks with the Successor Representation

no code implementations • 23 Mar 2018 • Craig Sherstan, Marlos C. Machado, Patrick M. Pilarski

As a primary contribution of this work, we show that using SR-based predictions can improve sample efficiency and learning speed in a continual learning setting where new predictions are incrementally added and learned over time.

Continual Learning Reinforcement Learning (RL)

Paper
Add Code

Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods

no code implementations • 25 Jan 2018 • Craig Sherstan, Brendan Bennett, Kenny Young, Dylan R. Ashley, Adam White, Martha White, Richard S. Sutton

This paper investigates estimating the variance of a temporal-difference learning agent's update target.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Communicative Capital for Prosthetic Agents

no code implementations • 10 Nov 2017 • Patrick M. Pilarski, Richard S. Sutton, Kory W. Mathewson, Craig Sherstan, Adam S. R. Parker, Ann L. Edwards

This work presents an overarching perspective on the role that machine intelligence can play in enhancing human abilities, especially those that have been diminished due to injury or illness.

Paper
Add Code

Introspective Agents: Confidence Measures for General Value Functions

no code implementations • 17 Jun 2016 • Craig Sherstan, Adam White, Marlos C. Machado, Patrick M. Pilarski

Agents of general intelligence deployed in real-world scenarios must adapt to ever-changing environmental conditions.

Position

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.