Search Results for author: Thomas M. Moerland

Found 19 papers, 11 papers with code

Explicitly Disentangled Representations in Object-Centric Learning

1 code implementation18 Jan 2024 Riccardo Majellaro, Jonathan Collu, Aske Plaat, Thomas M. Moerland

Extracting structured representations from raw visual data is an important and long-standing challenge in machine learning.

Disentanglement Object

EduGym: An Environment and Notebook Suite for Reinforcement Learning Education

1 code implementation17 Nov 2023 Thomas M. Moerland, Matthias Müller-Brockhausen, Zhao Yang, Andrius Bernatavicius, Koen Ponse, Tom Kouwenhoven, Andreas Sauter, Michiel van der Meer, Bram Renting, Aske Plaat

To solve this issue we introduce EduGym, a set of educational reinforcement learning environments and associated interactive notebooks tailored for education.

reinforcement-learning

What model does MuZero learn?

no code implementations1 Jun 2023 Jinke He, Thomas M. Moerland, Frans A. Oliehoek

Model-based reinforcement learning has drawn considerable interest in recent years, given its promise to improve sample efficiency.

Model-based Reinforcement Learning reinforcement-learning

First Go, then Post-Explore: the Benefits of Post-Exploration in Intrinsic Motivation

no code implementations6 Dec 2022 Zhao Yang, Thomas M. Moerland, Mike Preuss, Aske Plaat

In this paper, we present a clear ablation study of post-exploration in a general intrinsically motivated goal exploration process (IMGEP) framework, that the Go-Explore paper did not show.

Continuous Control Reinforcement Learning (RL)

Continuous Episodic Control

no code implementations28 Nov 2022 Zhao Yang, Thomas M. Moerland, Mike Preuss, Aske Plaat

Therefore, this paper introduces Continuous Episodic Control (CEC), a novel non-parametric episodic memory algorithm for sequential decision making in problems with a continuous action space.

Continuous Control Decision Making +2

When to Go, and When to Explore: The Benefit of Post-Exploration in Intrinsic Motivation

no code implementations29 Mar 2022 Zhao Yang, Thomas M. Moerland, Mike Preuss, Aske Plaat

Go-Explore achieved breakthrough performance on challenging reinforcement learning (RL) tasks with sparse rewards.

Reinforcement Learning (RL)

On Credit Assignment in Hierarchical Reinforcement Learning

1 code implementation7 Mar 2022 Joery A. de Vries, Thomas M. Moerland, Aske Plaat

To improve our fundamental understanding of HRL, we investigate hierarchical credit assignment from the perspective of conventional multistep reinforcement learning.

Hierarchical Reinforcement Learning reinforcement-learning +1

Visualizing MuZero Models

1 code implementation ICML Workshop URL 2021 Joery A. de Vries, Ken S. Voskuil, Thomas M. Moerland, Aske Plaat

In contrast to standard forward dynamics models that predict a full next state, value equivalent models are trained to predict a future value, thereby emphasizing value relevant information in the representations.

Game of Go Model-based Reinforcement Learning

A Unifying Framework for Reinforcement Learning and Planning

no code implementations26 Jun 2020 Thomas M. Moerland, Joost Broekens, Aske Plaat, Catholijn M. Jonker

Therefore, this paper presents a unifying algorithmic framework for reinforcement learning and planning (FRAP), which identifies underlying dimensions on which MDP planning and learning algorithms have to decide.

Decision Making reinforcement-learning +1

The Second Type of Uncertainty in Monte Carlo Tree Search

1 code implementation19 May 2020 Thomas M. Moerland, Joost Broekens, Aske Plaat, Catholijn M. Jonker

Monte Carlo Tree Search (MCTS) efficiently balances exploration and exploitation in tree search based on count-derived uncertainty.

Vocal Bursts Type Prediction

The Potential of the Return Distribution for Exploration in RL

1 code implementation11 Jun 2018 Thomas M. Moerland, Joost Broekens, Catholijn M. Jonker

This paper studies the potential of the return distribution for exploration in deterministic reinforcement learning (RL) environments.

reinforcement-learning Reinforcement Learning (RL)

A0C: Alpha Zero in Continuous Action Space

2 code implementations24 May 2018 Thomas M. Moerland, Joost Broekens, Aske Plaat, Catholijn M. Jonker

A core novelty of Alpha Zero is the interleaving of tree search and deep learning, which has proven very successful in board games like Chess, Shogi and Go.

Board Games reinforcement-learning +2

Monte Carlo Tree Search for Asymmetric Trees

2 code implementations23 May 2018 Thomas M. Moerland, Joost Broekens, Aske Plaat, Catholijn M. Jonker

Asymmetric termination of search trees introduces a type of uncertainty for which the standard upper confidence bound (UCB) formula does not account.

Atari Games OpenAI Gym

Efficient exploration with Double Uncertain Value Networks

no code implementations29 Nov 2017 Thomas M. Moerland, Joost Broekens, Catholijn M. Jonker

This paper studies directed exploration for reinforcement learning agents by tracking uncertainty about the value of each available action.

Efficient Exploration Thompson Sampling

Emotion in Reinforcement Learning Agents and Robots: A Survey

no code implementations15 May 2017 Thomas M. Moerland, Joost Broekens, Catholijn M. Jonker

This article provides the first survey of computational models of emotion in reinforcement learning (RL) agents.

Decision Making reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.