Search Results for author: Merwan Barlier

Found 9 papers, 1 papers with code

Enhancing Reinforcement Learning Agents with Local Guides

1 code implementation21 Feb 2024 Paul Daoudi, Bogdan Robu, Christophe Prieur, Ludovic Dos Santos, Merwan Barlier

This paper addresses the problem of integrating local guide policies into a Reinforcement Learning agent.

reinforcement-learning

Improving a Proportional Integral Controller with Reinforcement Learning on a Throttle Valve Benchmark

no code implementations21 Feb 2024 Paul Daoudi, Bojan Mavkov, Bogdan Robu, Christophe Prieur, Emmanuel Witrant, Merwan Barlier, Ludovic Dos Santos

This paper presents a learning-based control strategy for non-linear throttle valves with an asymmetric hysteresis, leading to a near-optimal controller without requiring any prior knowledge about the environment.

Reinforcement Learning (RL)

Differentially Private Model-Based Offline Reinforcement Learning

no code implementations8 Feb 2024 Alexandre Rio, Merwan Barlier, Igor Colin, Albert Thomas

We address offline reinforcement learning with privacy guarantees, where the goal is to train a policy that is differentially private with respect to individual trajectories in the dataset.

reinforcement-learning

A Conservative Approach for Few-Shot Transfer in Off-Dynamics Reinforcement Learning

no code implementations24 Dec 2023 Paul Daoudi, Christophe Prieur, Bogdan Robu, Merwan Barlier, Ludovic Dos Santos

In the few-shot framework, a limited number of transitions from the target environment are introduced to facilitate a more effective transfer.

Imitation Learning

Clustered Multi-Agent Linear Bandits

no code implementations15 Sep 2023 Hamza Cherkaoui, Merwan Barlier, Igor Colin

We address in this paper a particular instance of the multi-agent linear stochastic bandit problem, called clustered multi-agent linear bandits.

Clustering

Price of Safety in Linear Best Arm Identification

no code implementations15 Sep 2023 Xuedong Shang, Igor Colin, Merwan Barlier, Hamza Cherkaoui

We introduce the safe best-arm identification framework with linear feedback, where the agent is subject to some stage-wise safety constraint that linearly depends on an unknown parameter vector.

Density Estimation for Conservative Q-Learning

no code implementations29 Sep 2021 Paul Daoudi, Merwan Barlier, Ludovic Dos Santos, Aladin Virmaux

We hence introduce Density Conservative Q-Learning (D-CQL), a batch-RL algorithm with strong theoretical guarantees that carefully penalizes the value function based on the amount of information collected in the state-action space.

Density Estimation Q-Learning

A Simple and Efficient Smoothing Method for Faster Optimization and Local Exploration

no code implementations NeurIPS 2020 Kevin Scaman, Ludovic Dos Santos, Merwan Barlier, Igor Colin

This novel smoothing method is then used to improve first-order non-smooth optimization (both convex and non-convex) by allowing for a local exploration of the search space.

Cannot find the paper you are looking for? You can Submit a new open access paper.