Search Results for author: Ludovic Dos Santos

Found 9 papers, 2 papers with code

Theoretical Limits of Pipeline Parallel Optimization and Application to Distributed Deep Learning

no code implementations NeurIPS 2019 Igor Colin, Ludovic Dos Santos, Kevin Scaman

For smooth convex and non-convex objective functions, we provide matching lower and upper complexity bounds and show that a naive pipeline parallelization of Nesterov's accelerated gradient descent is optimal.

Coloring graph neural networks for node disambiguation

no code implementations12 Dec 2019 George Dasoulas, Ludovic Dos Santos, Kevin Scaman, Aladin Virmaux

In this paper, we show that a simple coloring scheme can improve, both theoretically and empirically, the expressive power of Message Passing Neural Networks(MPNNs).

Graph Classification

A Simple and Efficient Smoothing Method for Faster Optimization and Local Exploration

no code implementations NeurIPS 2020 Kevin Scaman, Ludovic Dos Santos, Merwan Barlier, Igor Colin

This novel smoothing method is then used to improve first-order non-smooth optimization (both convex and non-convex) by allowing for a local exploration of the search space.

Density Estimation for Conservative Q-Learning

no code implementations29 Sep 2021 Paul Daoudi, Merwan Barlier, Ludovic Dos Santos, Aladin Virmaux

We hence introduce Density Conservative Q-Learning (D-CQL), a batch-RL algorithm with strong theoretical guarantees that carefully penalizes the value function based on the amount of information collected in the state-action space.

Density Estimation Q-Learning

Unifying GANs and Score-Based Diffusion as Generative Particle Models

1 code implementation NeurIPS 2023 Jean-Yves Franceschi, Mike Gartrell, Ludovic Dos Santos, Thibaut Issenhuth, Emmanuel de Bézenac, Mickaël Chen, Alain Rakotomamonjy

Particle-based deep generative models, such as gradient flows and score-based diffusion models, have recently gained traction thanks to their striking performance.

AdBooster: Personalized Ad Creative Generation using Stable Diffusion Outpainting

no code implementations8 Sep 2023 Veronika Shilova, Ludovic Dos Santos, Flavian vasile, Gaëtan Racic, Ugo Tanielian

In digital advertising, the selection of the optimal item (recommendation) and its best creative presentation (creative optimization) have traditionally been considered separate disciplines.

Data Augmentation

A Conservative Approach for Few-Shot Transfer in Off-Dynamics Reinforcement Learning

no code implementations24 Dec 2023 Paul Daoudi, Christophe Prieur, Bogdan Robu, Merwan Barlier, Ludovic Dos Santos

In the few-shot framework, a limited number of transitions from the target environment are introduced to facilitate a more effective transfer.

Imitation Learning

Enhancing Reinforcement Learning Agents with Local Guides

1 code implementation21 Feb 2024 Paul Daoudi, Bogdan Robu, Christophe Prieur, Ludovic Dos Santos, Merwan Barlier

This paper addresses the problem of integrating local guide policies into a Reinforcement Learning agent.

reinforcement-learning

Improving a Proportional Integral Controller with Reinforcement Learning on a Throttle Valve Benchmark

no code implementations21 Feb 2024 Paul Daoudi, Bojan Mavkov, Bogdan Robu, Christophe Prieur, Emmanuel Witrant, Merwan Barlier, Ludovic Dos Santos

This paper presents a learning-based control strategy for non-linear throttle valves with an asymmetric hysteresis, leading to a near-optimal controller without requiring any prior knowledge about the environment.

Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.