Search Results for author: Merwan Barlier

Found 9 papers, 1 papers with code

Enhancing Reinforcement Learning Agents with Local Guides

1 code implementation • 21 Feb 2024 • Paul Daoudi, Bogdan Robu, Christophe Prieur, Ludovic Dos Santos, Merwan Barlier

This paper addresses the problem of integrating local guide policies into a Reinforcement Learning agent.

2,928

Paper
Code

Improving a Proportional Integral Controller with Reinforcement Learning on a Throttle Valve Benchmark

no code implementations • 21 Feb 2024 • Paul Daoudi, Bojan Mavkov, Bogdan Robu, Christophe Prieur, Emmanuel Witrant, Merwan Barlier, Ludovic Dos Santos

This paper presents a learning-based control strategy for non-linear throttle valves with an asymmetric hysteresis, leading to a near-optimal controller without requiring any prior knowledge about the environment.

Reinforcement Learning (RL)

Paper
Add Code

Differentially Private Model-Based Offline Reinforcement Learning

no code implementations • 8 Feb 2024 • Alexandre Rio, Merwan Barlier, Igor Colin, Albert Thomas

We address offline reinforcement learning with privacy guarantees, where the goal is to train a policy that is differentially private with respect to individual trajectories in the dataset.

reinforcement-learning

Paper
Add Code

A Conservative Approach for Few-Shot Transfer in Off-Dynamics Reinforcement Learning

no code implementations • 24 Dec 2023 • Paul Daoudi, Christophe Prieur, Bogdan Robu, Merwan Barlier, Ludovic Dos Santos

In the few-shot framework, a limited number of transitions from the target environment are introduced to facilitate a more effective transfer.

Imitation Learning

Paper
Add Code

Clustered Multi-Agent Linear Bandits

no code implementations • 15 Sep 2023 • Hamza Cherkaoui, Merwan Barlier, Igor Colin

We address in this paper a particular instance of the multi-agent linear stochastic bandit problem, called clustered multi-agent linear bandits.

Clustering

Paper
Add Code

Price of Safety in Linear Best Arm Identification

no code implementations • 15 Sep 2023 • Xuedong Shang, Igor Colin, Merwan Barlier, Hamza Cherkaoui

We introduce the safe best-arm identification framework with linear feedback, where the agent is subject to some stage-wise safety constraint that linearly depends on an unknown parameter vector.

Paper
Add Code

Density Estimation for Conservative Q-Learning

no code implementations • 29 Sep 2021 • Paul Daoudi, Merwan Barlier, Ludovic Dos Santos, Aladin Virmaux

We hence introduce Density Conservative Q-Learning (D-CQL), a batch-RL algorithm with strong theoretical guarantees that carefully penalizes the value function based on the amount of information collected in the state-action space.

Density Estimation Q-Learning

Paper
Add Code

A Simple and Efficient Smoothing Method for Faster Optimization and Local Exploration

no code implementations • NeurIPS 2020 • Kevin Scaman, Ludovic Dos Santos, Merwan Barlier, Igor Colin

This novel smoothing method is then used to improve first-order non-smooth optimization (both convex and non-convex) by allowing for a local exploration of the search space.

Paper
Add Code

Human-Machine Dialogue as a Stochastic Game

no code implementations • WS 2015 • Merwan Barlier, Julien Perolat, Romain Laroche, Olivier Pietquin

Decision Making Multi-agent Reinforcement Learning +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.