no code implementations • 14 Jun 2024 • Paul-Antoine Le Tolguenec, Yann Besse, Florent Teichteil-Konigsbuch, Dennis G. Wilson, Emmanuel Rachelson
We propose a formalization of this search for diverse skills, building on a previous definition based on the mutual information between states and skills.
1 code implementation • 12 Jun 2024 • Adil Zouitine, David Bertoin, Pierre Clavier, Matthieu Geist, Emmanuel Rachelson
We introduce the Robust Reinforcement Learning Suite (RRLS), a benchmark suite based on Mujoco environments.
no code implementations • 12 Jun 2024 • Adil Zouitine, David Bertoin, Pierre Clavier, Matthieu Geist, Emmanuel Rachelson
Robust reinforcement learning is essential for deploying reinforcement learning algorithms in real-world scenarios where environmental uncertainty predominates.
no code implementations • 6 Jun 2024 • Pierre Clavier, Emmanuel Rachelson, Erwan Le Pennec, Matthieu Geist
On robust RL benchmarks, involving changes of the environment, we show that our approach is more robust than classic RL algorithms.
no code implementations • 7 May 2024 • Paul Templier, Emmanuel Rachelson, Antoine Cully, Dennis G. Wilson
Here, we highlight the phenomenon of genetic drift where the actor genome and the ES population distribution progressively drift apart, leading to injection having a negative impact on the ES.
no code implementations • 7 May 2024 • Paul Templier, Luca Grillotti, Emmanuel Rachelson, Dennis G. Wilson, Antoine Cully
Evolution Strategies (ES) are effective gradient-free optimization methods that can be competitive with gradient-based approaches for policy search.
no code implementations • 7 Dec 2022 • Paul-Antoine Le Tolguenec, Emmanuel Rachelson, Yann Besse, Dennis G. Wilson
In this work, we use a recently proposed definition of intrinsic motivation, Curiosity, in an evolutionary policy search method.
no code implementations • 5 Oct 2022 • Valentin Guillet, Dennis G. Wilson, Carlos Aguilar-Melchor, Emmanuel Rachelson
Learning a good state representation is a critical skill when dealing with multiple tasks in Reinforcement Learning as it allows for transfer and better generalization between tasks.
no code implementations • 5 Oct 2022 • Valentin Guillet, Dennis G. Wilson, Carlos Aguilar-Melchor, Emmanuel Rachelson
Although transfer learning is considered to be a milestone in deep reinforcement learning, the mechanisms behind it are still poorly understood.
1 code implementation • 16 Sep 2022 • David Bertoin, Adil Zouitine, Mehdi Zouitine, Emmanuel Rachelson
This implies that training of a policy with small generalization gap should focus on such important pixels and ignore the others.
no code implementations • ICLR 2022 • David Bertoin, Emmanuel Rachelson
We also demonstrate the effectiveness of CLOP as a general regularization technique in supervised learning.
1 code implementation • 17 Feb 2022 • Kaitlin Maile, Emmanuel Rachelson, Hervé Luga, Dennis G. Wilson
Neurogenesis in ANNs is an understudied and difficult problem, even compared to other forms of structural learning like pruning.
1 code implementation • 24 Dec 2021 • David Bertoin, Emmanuel Rachelson
This enables the isolation of task-specific information from both domains and a projection into a common representation.
2 code implementations • 4 Oct 2021 • Thibault Lahire, Matthieu Geist, Emmanuel Rachelson
The optimal sampling distribution being intractable, we make several approximations providing good results in practice and introduce, among others, LaBER (Large Batch Experience Replay), an easy-to-code and efficient method for sampling the replay buffer.
no code implementations • 1 Jan 2021 • David Bertoin, Emmanuel Rachelson
The domain adaptation problem involves learning a unique classification or regres-sion model capable of performing on both a source and a target domain.
1 code implementation • 15 Jan 2020 • Erwan Lecarpentier, David Abel, Kavosh Asadi, Yuu Jinnai, Emmanuel Rachelson, Michael L. Littman
We consider the problem of knowledge transfer when an agent is facing a series of Reinforcement Learning (RL) tasks.
1 code implementation • NeurIPS 2019 • Erwan Lecarpentier, Emmanuel Rachelson
time; 2) we consider a planning agent using the current model of the environment but unaware of its future evolution.
Model-based Reinforcement Learning Reinforcement Learning (RL)
no code implementations • 12 Jul 2019 • Andrea Lodi, Luca Mossina, Emmanuel Rachelson
Although presented through the application to the facility location problem, the approach developed here is general and explores a new perspective on the exploitation of past experience in combinatorial optimization.
no code implementations • 9 May 2019 • Luca Mossina, Emmanuel Rachelson, Daniel Delahaye
We study how Reinforcement Learning can be employed to optimally control parameters in evolutionary algorithms.
no code implementations • 22 Apr 2019 • Erwan Lecarpentier, Emmanuel Rachelson
time; 2) we consider a planning agent using the current model of the environment but unaware of its future evolution.
no code implementations • 3 May 2018 • Erwan Lecarpentier, Guillaume Infantes, Charles Lesire, Emmanuel Rachelson
In the context of tree-search stochastic planning algorithms where a generative model is available, we consider on-line planning algorithms building trees in order to recommend an action.
1 code implementation • 19 Jul 2017 • Luca Mossina, Emmanuel Rachelson
This article focuses on the question of learning how to automatically select a subset of items among a bigger set.
no code implementations • 18 Jul 2017 • Erwan Lecarpentier, Sebastian Rapp, Marc Melo, Emmanuel Rachelson
Autonomous unpowered flight is a challenge for control and guidance systems: all the energy the aircraft might use during flight has to be harvested directly from the atmosphere.