Search Results for author: Othman Gaizi

Found 3 papers, 1 papers with code

Conservative Exploration for Policy Optimization via Off-Policy Policy Evaluation

no code implementations24 Dec 2023 Paul Daoudi, Mathias Formoso, Othman Gaizi, Achraf Azize, Evrard Garcelon

A precondition for the deployment of a Reinforcement Learning agent to a real-world system is to provide guarantees on the learning process.

The guide and the explorer: smart agents for resource-limited iterated batch reinforcement learning

no code implementations29 Sep 2021 Albert Thomas, Balázs Kégl, Othman Gaizi, Gabriel Hurtado

Iterated batch reinforcement learning (RL) is a growing subfield fueled by the demand from systems engineers for intelligent control solutions that they can apply within their technical and organizational constraints.

Acrobot Model Predictive Control +1

Conservative Optimistic Policy Optimization via Multiple Importance Sampling

1 code implementation4 Mar 2021 Achraf Azize, Othman Gaizi

Reinforcement Learning (RL) has been able to solve hard problems such as playing Atari games or solving the game of Go, with a unified approach.

Atari Games Game of Go +2

Cannot find the paper you are looking for? You can Submit a new open access paper.