Search Results for author: Othman Gaizi

Found 3 papers, 1 papers with code

Conservative Exploration for Policy Optimization via Off-Policy Policy Evaluation

no code implementations • 24 Dec 2023 • Paul Daoudi, Mathias Formoso, Othman Gaizi, Achraf Azize, Evrard Garcelon

A precondition for the deployment of a Reinforcement Learning agent to a real-world system is to provide guarantees on the learning process.

Paper
Add Code

The guide and the explorer: smart agents for resource-limited iterated batch reinforcement learning

no code implementations • 29 Sep 2021 • Albert Thomas, Balázs Kégl, Othman Gaizi, Gabriel Hurtado

Iterated batch reinforcement learning (RL) is a growing subfield fueled by the demand from systems engineers for intelligent control solutions that they can apply within their technical and organizational constraints.

Acrobot Model Predictive Control +1

Paper
Add Code

Conservative Optimistic Policy Optimization via Multiple Importance Sampling

1 code implementation • 4 Mar 2021 • Achraf Azize, Othman Gaizi

Reinforcement Learning (RL) has been able to solve hard problems such as playing Atari games or solving the game of Go, with a unified approach.

Atari Games Game of Go +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.