Search Results for author: Kenny Young

Found 12 papers, 2 papers with code

Iterative Option Discovery for Planning, by Planning

no code implementations2 Oct 2023 Kenny Young, Richard S. Sutton

Discovering useful temporal abstractions, in the form of options, is widely thought to be key to applying reinforcement learning and planning to increasingly complex domains.

The Benefits of Model-Based Generalization in Reinforcement Learning

1 code implementation4 Nov 2022 Kenny Young, Aditya Ramesh, Louis Kirsch, Jürgen Schmidhuber

First, we provide a simple theorem motivating how learning a model as an intermediate step can narrow down the set of possible value functions more than learning a value function directly from data using the Bellman equation.

Model-based Reinforcement Learning reinforcement-learning +1

Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions

no code implementations4 Jul 2022 Tian Tian, Kenny Young, Richard S. Sutton

However, Asynchronous VI still requires a maximization over the entire action space, making it impractical for domains with large action space.

Hindsight Network Credit Assignment: Efficient Credit Assignment in Networks of Discrete Stochastic Units

no code implementations14 Oct 2021 Kenny Young

We then show how HNCA can be extended to optimize a more general function of the outputs of a network of stochastic units, where the function is known to the agent.

Hindsight Network Credit Assignment

no code implementations24 Nov 2020 Kenny Young

We present Hindsight Network Credit Assignment (HNCA), a novel learning method for stochastic neural networks, which works by assigning credit to each neuron's stochastic output based on how it influences the output of its immediate children in the network.

Understanding the Pathologies of Approximate Policy Evaluation when Combined with Greedification in Reinforcement Learning

no code implementations28 Oct 2020 Kenny Young, Richard S. Sutton

We demonstrate analytically and experimentally that such pathological behaviours can impact a wide range of RL and dynamic programming algorithms; such behaviours can arise both with and without bootstrapping, and with linear function approximation as well as with more complex parameterized functions like neural networks.

Reinforcement Learning (RL)

Variance Reduced Advantage Estimation with $δ$ Hindsight Credit Assignment

no code implementations19 Nov 2019 Kenny Young

Hindsight Credit Assignment (HCA) refers to a recently proposed family of methods for producing more efficient credit assignment in reinforcement learning.

reinforcement-learning Reinforcement Learning (RL)

MinAtar: An Atari-Inspired Testbed for Thorough and Reproducible Reinforcement Learning Experiments

3 code implementations7 Mar 2019 Kenny Young, Tian Tian

With the representation learning problem simplified, we can perform experiments with significantly less computational expense.

Atari Games reinforcement-learning +2

Metatrace Actor-Critic: Online Step-size Tuning by Meta-gradient Descent for Reinforcement Learning Control

no code implementations10 May 2018 Kenny Young, Baoxiang Wang, Matthew E. Taylor

Finally, we apply Metatrace for control with nonlinear function approximation in 5 games in the Arcade Learning Environment where we explore how it impacts learning speed and robustness to initial step-size choice.

Atari Games Meta-Learning +1

A Reverse Hex Solver

no code implementations26 Apr 2017 Kenny Young, Ryan B. Hayward

We present Solrex, an automated solver for the game of Reverse Hex. Reverse Hex, also known as Rex, or Misere Hex, is the variant of the game of Hex in which the player who joins her two sides loses the game.

Position

Neurohex: A Deep Q-learning Hex Agent

no code implementations24 Apr 2016 Kenny Young, Ryan Hayward, Gautham Vasan

DeepMind's recent spectacular success in using deep convolutional neural nets and machine learning to build superhuman level agents --- e. g. for Atari games via deep Q-learning and for the game of Go via Reinforcement Learning --- raises many questions, including to what extent these methods will succeed in other domains.

Atari Games Game of Go +1

Cannot find the paper you are looking for? You can Submit a new open access paper.