Search Results for author: Marco A. Wiering

Found 7 papers, 4 papers with code

Multi-Source Transfer Learning for Deep Model-Based Reinforcement Learning

no code implementations28 May 2022 Remo Sasso, Matthia Sabatelli, Marco A. Wiering

A crucial challenge in reinforcement learning is to reduce the number of interactions with the environment that an agent requires to master a given task.

Continuous Control Model-based Reinforcement Learning +3

Fractional Transfer Learning for Deep Model-Based Reinforcement Learning

no code implementations14 Aug 2021 Remo Sasso, Matthia Sabatelli, Marco A. Wiering

Reinforcement learning (RL) is well known for requiring large amounts of data in order for RL agents to learn to perform complex tasks.

Model-based Reinforcement Learning reinforcement-learning +2

Enhancing reinforcement learning by a finite reward response filter with a case study in intelligent structural control

no code implementations25 Oct 2020 Hamid Radmard Rahmani, Carsten Koenke, Marco A. Wiering

In many reinforcement learning (RL) problems, it takes some time until a taken action by the agent reaches its maximum effect on the environment and consequently the agent receives the reward corresponding to that action by a delay called action-effect delay.

Q-Learning reinforcement-learning +1

Continuous-action Reinforcement Learning for Playing Racing Games: Comparing SPG to PPO

1 code implementation15 Jan 2020 Mario S. Holubar, Marco A. Wiering

Different versions of two actor-critic learning algorithms are tested on this environment: Sampled Policy Gradient (SPG) and Proximal Policy Optimization (PPO).

OpenAI Gym reinforcement-learning +1

Approximating two value functions instead of one: towards characterizing a new family of Deep Reinforcement Learning algorithms

3 code implementations1 Sep 2019 Matthia Sabatelli, Gilles Louppe, Pierre Geurts, Marco A. Wiering

This paper makes one step forward towards characterizing a new family of \textit{model-free} Deep Reinforcement Learning (DRL) algorithms.

Q-Learning

Deep Quality-Value (DQV) Learning

3 code implementations30 Sep 2018 Matthia Sabatelli, Gilles Louppe, Pierre Geurts, Marco A. Wiering

We introduce a novel Deep Reinforcement Learning (DRL) algorithm called Deep Quality-Value (DQV) Learning.

Atari Games Q-Learning +2

Sampled Policy Gradient for Learning to Play the Game Agar.io

2 code implementations15 Sep 2018 Anton Orell Wiehe, Nil Stolt Ansó, Madalina M. Drugan, Marco A. Wiering

In this paper, a new offline actor-critic learning algorithm is introduced: Sampled Policy Gradient (SPG).

Q-Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.