Search Results for author: Albert Thomas

Found 13 papers, 1 papers with code

Model-based micro-data reinforcement learning: what are the crucial model properties and which model to choose?

1 code implementation • ICLR 2021 • Balázs Kégl, Gabriel Hurtado, Albert Thomas

We contribute to micro-data model-based reinforcement learning (MBRL) by rigorously comparing popular generative models using a fixed (random shooting) control agent.

Acrobot Model-based Reinforcement Learning

Paper
Code

Mass Volume Curves and Anomaly Ranking

no code implementations • 3 May 2017 • Stephan Clémençon, Albert Thomas

Here we formulate the issue of scoring anomalies as a M-estimation problem by means of a novel functional performance criterion, referred to as the Mass Volume curve (MV curve in short), whose optimal elements are strictly increasing transforms of the density almost everywhere on the support of the density.

Generalization Bounds

Paper
Add Code

Calibration of One-Class SVM for MV set estimation

no code implementations • 30 Aug 2015 • Albert Thomas, Vincent Feuillard, Alexandre Gramfort

Our approach makes it possible to tune the hyperparameters automatically and obtain nested set estimates.

Anomaly Detection Novelty Detection

Paper
Add Code

Parallel Contextual Bandits in Wireless Handover Optimization

no code implementations • 21 Jan 2019 • Igor Colin, Albert Thomas, Moez Draief

As cellular networks become denser, a scalable and dynamic tuning of wireless base station parameters can only be achieved through automated optimization.

Multi-Armed Bandits Thompson Sampling

Paper
Add Code

Best Arm Identification in Graphical Bilinear Bandits

no code implementations • 14 Dec 2020 • Geovani Rizk, Albert Thomas, Igor Colin, Rida Laraki, Yann Chevaleyre

We study the best arm identification problem in which the learner wants to find the graph allocation maximizing the sum of the bilinear rewards.

Paper
Add Code

Refined bounds for randomized experimental design

no code implementations • 22 Dec 2020 • Geovani Rizk, Igor Colin, Albert Thomas, Moez Draief

Experimental design is an approach for selecting samples among a given set so as to obtain the best estimator for a given criterion.

Experimental Design

Paper
Add Code

The guide and the explorer: smart agents for resource-limited iterated batch reinforcement learning

no code implementations • 29 Sep 2021 • Albert Thomas, Balázs Kégl, Othman Gaizi, Gabriel Hurtado

Iterated batch reinforcement learning (RL) is a growing subfield fueled by the demand from systems engineers for intelligent control solutions that they can apply within their technical and organizational constraints.

Acrobot Model Predictive Control +1

Paper
Add Code

An $α$-No-Regret Algorithm For Graphical Bilinear Bandits

no code implementations • 1 Jun 2022 • Geovani Rizk, Igor Colin, Albert Thomas, Rida Laraki, Yann Chevaleyre

We propose the first regret-based approach to the Graphical Bilinear Bandits problem, where $n$ agents in a graph play a stochastic bilinear bandit game with each of their neighbors.

Paper
Add Code

Guided Safe Shooting: model based reinforcement learning with safety constraints

no code implementations • 20 Jun 2022 • Giuseppe Paolo, Jonas Gonzalez-Billandon, Albert Thomas, Balázs Kégl

In the last decade, reinforcement learning successfully solved complex control tasks and decision-making problems, like the Go board game.

Decision Making Model-based Reinforcement Learning +2

Paper
Add Code

Multi-timestep models for Model-based Reinforcement Learning

no code implementations • 9 Oct 2023 • Abdelhakim Benechehab, Giuseppe Paolo, Albert Thomas, Maurizio Filippone, Balázs Kégl

In model-based reinforcement learning (MBRL), most algorithms rely on simulating trajectories from one-step dynamics models learned on data.

Model-based Reinforcement Learning reinforcement-learning +1

Paper
Add Code

A Multi-step Loss Function for Robust Learning of the Dynamics in Model-based Reinforcement Learning

no code implementations • 5 Feb 2024 • Abdelhakim Benechehab, Albert Thomas, Giuseppe Paolo, Maurizio Filippone, Balázs Kégl

In model-based reinforcement learning, most algorithms rely on simulating trajectories from one-step models of the dynamics learned on data.

Future prediction Model-based Reinforcement Learning +1

Paper
Add Code

Deep autoregressive density nets vs neural ensembles for model-based offline reinforcement learning

no code implementations • 5 Feb 2024 • Abdelhakim Benechehab, Albert Thomas, Balázs Kégl

We consider the problem of offline reinforcement learning where only a set of system transitions is made available for policy optimization.

D4RL Model-based Reinforcement Learning +1

Paper
Add Code

Differentially Private Model-Based Offline Reinforcement Learning

no code implementations • 8 Feb 2024 • Alexandre Rio, Merwan Barlier, Igor Colin, Albert Thomas

We address offline reinforcement learning with privacy guarantees, where the goal is to train a policy that is differentially private with respect to individual trajectories in the dataset.

reinforcement-learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.