Search Results for author: Joel Veness

Found 28 papers, 14 papers with code

A Monte Carlo AIXI Approximation

2 code implementations • 4 Sep 2009 • Joel Veness, Kee Siong Ng, Marcus Hutter, William Uther, David Silver

This paper introduces a principled approach for the design of a scalable general reinforcement learning agent.

General Reinforcement Learning Open-Ended Question Answering +2

Paper
Code

Bootstrapping from Game Tree Search

no code implementations • NeurIPS 2009 • Joel Veness, David Silver, Alan Blair, William Uther

We implemented our algorithm in a chess program Meep, using a linear heuristic function.

Paper
Add Code

Reinforcement Learning via AIXI Approximation

no code implementations • AAAI 2010 2010 • Joel Veness, Kee Siong Ng, Marcus Hutter, David Silver

This paper introduces a principled approach for the design of a scalable general reinforcement learning agent.

General Reinforcement Learning Open-Ended Question Answering +2

Paper
Add Code

Monte-Carlo Planning in Large POMDPs

no code implementations • NeurIPS 2010 • David Silver, Joel Veness

Our Monte-Carlo planning algorithm achieved a high level of performance with no prior knowledge, and was also able to exploit simple domain knowledge to achieve better results with less search.

Paper
Add Code

Context Tree Switching

1 code implementation • 14 Nov 2011 • Joel Veness, Kee Siong Ng, Marcus Hutter, Michael Bowling

This paper describes the Context Tree Switching technique, a modification of Context Tree Weighting for the prediction of binary, stationary, n-Markov sources.

Information Theory Information Theory

Paper
Code

Variance Reduction in Monte-Carlo Tree Search

no code implementations • NeurIPS 2011 • Joel Veness, Marc Lanctot, Michael Bowling

Monte-Carlo Tree Search (MCTS) has proven to be a powerful, generic planning technique for decision-making in single-agent and adversarial environments.

Decision Making

Paper
Add Code

The Arcade Learning Environment: An Evaluation Platform for General Agents

3 code implementations • 19 Jul 2012 • Marc G. Bellemare, Yavar Naddaf, Joel Veness, Michael Bowling

We illustrate the promise of ALE by developing and benchmarking domain-independent agents designed using well-established AI techniques for both reinforcement learning and planning.

Ranked #1 on Atari Games on Atari 2600 Pooyan

Atari Games Benchmarking +4

2,071

Paper
Code

Sketch-Based Linear Value Function Approximation

no code implementations • NeurIPS 2012 • Marc Bellemare, Joel Veness, Michael Bowling

Unfortunately, the typical use of hashing in value function approximation results in biased value estimates due to the possibility of collisions.

Atari Games reinforcement-learning +1

Paper
Add Code

Online Learning of k-CNF Boolean Functions

no code implementations • 26 Mar 2014 • Joel Veness, Marcus Hutter

This paper revisits the problem of learning a k-CNF Boolean function from examples in the context of online learning under the logarithmic loss.

PAC learning

Paper
Add Code

Compress and Control

no code implementations • 19 Nov 2014 • Joel Veness, Marc G. Bellemare, Marcus Hutter, Alvin Chua, Guillaume Desjardins

This paper describes a new information-theoretic policy evaluation technique for reinforcement learning.

Reinforcement Learning (RL)

Paper
Add Code

Human level control through deep reinforcement learning

7 code implementations • 25 Feb 2015 • Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin Riedmiller, Andreas K. Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg1 & Demis Hassabis

We demonstrate that the deep Q-network agent, receiving only the pixels and the game score as inputs, was able to surpass the performance of all previous algorithms and achieve a level comparable to that of a professional human games tester across a set of 49 games, using the same algorithm, network architecture and hyperparameters.

Atari Games reinforcement-learning +1

143

Paper
Code

The Forget-me-not Process

no code implementations • NeurIPS 2016 • Kieran Milan, Joel Veness, James Kirkpatrick, Michael Bowling, Anna Koop, Demis Hassabis

We introduce the Forget-me-not Process, an efficient, non-parametric meta-algorithm for online probabilistic sequence prediction for piecewise stationary, repeating sources.

Paper
Add Code

Overcoming catastrophic forgetting in neural networks

24 code implementations • 2 Dec 2016 • James Kirkpatrick, Razvan Pascanu, Neil Rabinowitz, Joel Veness, Guillaume Desjardins, Andrei A. Rusu, Kieran Milan, John Quan, Tiago Ramalho, Agnieszka Grabska-Barwinska, Demis Hassabis, Claudia Clopath, Dharshan Kumaran, Raia Hadsell

The ability to learn tasks in a sequential fashion is crucial to the development of artificial intelligence.

Ranked #3 on Continual Learning on F-CelebA (10 tasks)

Atari Games Class Incremental Learning +2

1,659

Paper
Code

Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents

7 code implementations • 18 Sep 2017 • Marlos C. Machado, Marc G. Bellemare, Erik Talvitie, Joel Veness, Matthew Hausknecht, Michael Bowling

The Arcade Learning Environment (ALE) is an evaluation platform that poses the challenge of building AI agents with general competency across dozens of Atari 2600 games.

Atari Games

2,071

Paper
Code

Online Learning with Gated Linear Networks

no code implementations • 5 Dec 2017 • Joel Veness, Tor Lattimore, Avishkar Bhoopchand, Agnieszka Grabska-Barwinska, Christopher Mattern, Peter Toth

This paper describes a family of probabilistic architectures designed for online learning under the logarithmic loss.

Paper
Add Code

Meta-learning of Sequential Strategies

no code implementations • 8 May 2019 • Pedro A. Ortega, Jane. X. Wang, Mark Rowland, Tim Genewein, Zeb Kurth-Nelson, Razvan Pascanu, Nicolas Heess, Joel Veness, Alex Pritzel, Pablo Sprechmann, Siddhant M. Jayakumar, Tom McGrath, Kevin Miller, Mohammad Azar, Ian Osband, Neil Rabinowitz, András György, Silvia Chiappa, Simon Osindero, Yee Whye Teh, Hado van Hasselt, Nando de Freitas, Matthew Botvinick, Shane Legg

In this report we review memory-based meta-learning as a tool for building sample-efficient strategies that learn from past experience to adapt to any task within a target class.

Meta-Learning

Paper
Add Code

Gated Linear Networks

1 code implementation • 30 Sep 2019 • Joel Veness, Tor Lattimore, David Budden, Avishkar Bhoopchand, Christopher Mattern, Agnieszka Grabska-Barwinska, Eren Sezener, Jianan Wang, Peter Toth, Simon Schmitt, Marcus Hutter

This paper presents a new family of backpropagation-free neural architectures, Gated Linear Networks (GLNs).

Image Classification

Paper
Code

Online Learning in Contextual Bandits using Gated Linear Networks

no code implementations • NeurIPS 2020 • Eren Sezener, Marcus Hutter, David Budden, Jianan Wang, Joel Veness

We introduce a new and completely online contextual bandit algorithm called Gated Linear Contextual Bandits (GLCB).

Multi-Armed Bandits

Paper
Add Code

Gaussian Gated Linear Networks

2 code implementations • NeurIPS 2020 • David Budden, Adam Marblestone, Eren Sezener, Tor Lattimore, Greg Wayne, Joel Veness

We propose the Gaussian Gated Linear Network (G-GLN), an extension to the recently proposed GLN family of deep neural networks.

Denoising Density Estimation +2

12,779

Paper
Code

A Combinatorial Perspective on Transfer Learning

1 code implementation • NeurIPS 2020 • Jianan Wang, Eren Sezener, David Budden, Marcus Hutter, Joel Veness

Our main postulate is that the combination of task segmentation, modular learning and memory-based ensembling can give rise to generalization on an exponentially growing number of unseen tasks.

Continual Learning Transfer Learning

12,779

Paper
Code

Reinforcement Learning with Information-Theoretic Actuation

no code implementations • 30 Sep 2021 • Elliot Catt, Marcus Hutter, Joel Veness

In this work we explore and formalize a contrasting view, namely that actions are best thought of as the output of a sequence of internal choices with respect to an action model.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Shaking the foundations: delusions in sequence models for interaction and control

no code implementations • 20 Oct 2021 • Pedro A. Ortega, Markus Kunesch, Grégoire Delétang, Tim Genewein, Jordi Grau-Moya, Joel Veness, Jonas Buchli, Jonas Degrave, Bilal Piot, Julien Perolat, Tom Everitt, Corentin Tallec, Emilio Parisotto, Tom Erez, Yutian Chen, Scott Reed, Marcus Hutter, Nando de Freitas, Shane Legg

The recent phenomenal success of language models has reinvigorated machine learning research, and large sequence models such as transformers are being applied to a variety of domains.

counterfactual

Paper
Add Code

Neural Networks and the Chomsky Hierarchy

2 code implementations • 5 Jul 2022 • Grégoire Delétang, Anian Ruoss, Jordi Grau-Moya, Tim Genewein, Li Kevin Wenliang, Elliot Catt, Chris Cundy, Marcus Hutter, Shane Legg, Joel Veness, Pedro A. Ortega

Reliable generalization lies at the heart of safe ML and AI.

157

Paper
Code

Beyond Bayes-optimality: meta-learning what you know you don't know

no code implementations • 30 Sep 2022 • Jordi Grau-Moya, Grégoire Delétang, Markus Kunesch, Tim Genewein, Elliot Catt, Kevin Li, Anian Ruoss, Chris Cundy, Joel Veness, Jane Wang, Marcus Hutter, Christopher Summerfield, Shane Legg, Pedro Ortega

This is in contrast to risk-sensitive agents, which additionally exploit the higher-order moments of the return, and ambiguity-sensitive agents, which act differently when recognizing situations in which they lack knowledge.

Decision Making Meta-Learning

Paper
Add Code

Memory-Based Meta-Learning on Non-Stationary Distributions

1 code implementation • 6 Feb 2023 • Tim Genewein, Grégoire Delétang, Anian Ruoss, Li Kevin Wenliang, Elliot Catt, Vincent Dutordoir, Jordi Grau-Moya, Laurent Orseau, Marcus Hutter, Joel Veness

Memory-based meta-learning is a technique for approximating Bayes-optimal predictors.

Bayesian Inference Meta-Learning

Paper
Code

Randomized Positional Encodings Boost Length Generalization of Transformers

1 code implementation • 26 May 2023 • Anian Ruoss, Grégoire Delétang, Tim Genewein, Jordi Grau-Moya, Róbert Csordás, Mehdi Bennani, Shane Legg, Joel Veness

Transformers have impressive generalization capabilities on tasks with a fixed context length.

Paper
Code

Language Modeling Is Compression

1 code implementation • 19 Sep 2023 • Grégoire Delétang, Anian Ruoss, Paul-Ambroise Duquenne, Elliot Catt, Tim Genewein, Christopher Mattern, Jordi Grau-Moya, Li Kevin Wenliang, Matthew Aitchison, Laurent Orseau, Marcus Hutter, Joel Veness

We show that large language models are powerful general-purpose predictors and that the compression viewpoint provides novel insights into scaling laws, tokenization, and in-context learning.

In-Context Learning Language Modelling

Paper
Code

Learning Universal Predictors

1 code implementation • 26 Jan 2024 • Jordi Grau-Moya, Tim Genewein, Marcus Hutter, Laurent Orseau, Grégoire Delétang, Elliot Catt, Anian Ruoss, Li Kevin Wenliang, Christopher Mattern, Matthew Aitchison, Joel Veness

Meta-learning has emerged as a powerful approach to train neural networks to learn new tasks quickly from limited data.

Meta-Learning

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.