no code implementations • 26 Mar 2024 • Marius Captari, Remo Sasso, Matthia Sabatelli
While more sophisticated exploration strategies can excel in specific, often sparse reward environments, existing simpler approaches, such as $\epsilon$-greedy, persist in outperforming them across a broader spectrum of domains.
1 code implementation • 12 Jan 2024 • Tsegaye Misikir Tashu, Eduard-Raul Kontos, Matthia Sabatelli, Matias Valdenegro-Toro
Recommendation systems, for documents, have become tools to find relevant content on the Web.
no code implementations • 21 Jul 2023 • Arthur Müller, Matthia Sabatelli
Subsequently, we evaluated the performance of the two methods on a separate model of the same intersection that was developed with a different traffic simulator.
1 code implementation • 11 Oct 2022 • Julius Wagenbach, Matthia Sabatelli
We study whether the learning rate $\alpha$, the discount factor $\gamma$ and the reward signal $r$ have an influence on the overestimation bias of the Q-Learning algorithm.
no code implementations • 7 Sep 2022 • Matias Valdenegro-Toro, Matthia Sabatelli
Overfitting and generalization is an important concept in Machine Learning as only models that generalize are interesting for general applications.
1 code implementation • 9 Aug 2022 • Vincent Tonkes, Matthia Sabatelli
Vision Transformers (VTs) are becoming a valuable alternative to Convolutional Neural Networks (CNNs) when it comes to problems involving high-dimensional and spatially organized inputs such as images.
no code implementations • 21 Jun 2022 • Arthur Müller, Matthia Sabatelli
We present an approach to ensure safety in a real-world intersection by using an action space that is safe by design.
no code implementations • 28 May 2022 • Remo Sasso, Matthia Sabatelli, Marco A. Wiering
A crucial challenge in reinforcement learning is to reduce the number of interactions with the environment that an agent requires to master a given task.
no code implementations • 6 Oct 2021 • Matthia Sabatelli, Pierre Geurts
Transfer Learning (TL) is an efficient machine learning paradigm that allows overcoming some of the hurdles that characterize the successful training of deep neural networks, ranging from long training times to the needs of large datasets.
no code implementations • 14 Aug 2021 • Remo Sasso, Matthia Sabatelli, Marco A. Wiering
Reinforcement learning (RL) is well known for requiring large amounts of data in order for RL agents to learn to perform complex tasks.
Model-based Reinforcement Learning reinforcement-learning +2
1 code implementation • 22 Dec 2020 • Pascal Leroy, Damien Ernst, Pierre Geurts, Gilles Louppe, Jonathan Pisane, Matthia Sabatelli
This paper introduces four new algorithms that can be used for tackling multi-agent reinforcement learning (MARL) problems occurring in cooperative settings.
no code implementations • 11 May 2020 • Matthia Sabatelli, Mike Kestemont, Pierre Geurts
We study the generalization properties of pruned neural networks that are the winners of the lottery ticket hypothesis on datasets of natural images.
3 code implementations • 1 Sep 2019 • Matthia Sabatelli, Gilles Louppe, Pierre Geurts, Marco A. Wiering
This paper makes one step forward towards characterizing a new family of \textit{model-free} Deep Reinforcement Learning (DRL) algorithms.
3 code implementations • 30 Sep 2018 • Matthia Sabatelli, Gilles Louppe, Pierre Geurts, Marco A. Wiering
We introduce a novel Deep Reinforcement Learning (DRL) algorithm called Deep Quality-Value (DQV) Learning.