Search Results for author: Firas Jarboui

Found 8 papers, 0 papers with code

Delayed Geometric Discounts: An Alternative Criterion for Reinforcement Learning

no code implementations • 26 Sep 2022 • Firas Jarboui, Ahmed Akakzia

The endeavor of artificial intelligence (AI) is to design autonomous agents capable of achieving complex tasks.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Unsupervised Neural Hidden Markov Models with a Continuous latent state space

no code implementations • 10 Jun 2021 • Firas Jarboui, Vianney Perchet

We introduce a new procedure to neuralize unsupervised Hidden Markov Models in the continuous case.

Paper
Add Code

Offline Inverse Reinforcement Learning

no code implementations • 9 Jun 2021 • Firas Jarboui, Vianney Perchet

Current solutions either solve a behaviour cloning problem (which does not leverage the exploratory data) or a reinforced imitation learning problem (using a fixed cost function that discriminates available exploratory trajectories from expert ones).

Data Augmentation Imitation Learning +4

Paper
Add Code

Quickest change detection with unknown parameters: Constant complexity and near optimality

no code implementations • 9 Jun 2021 • Firas Jarboui, Viannet Perchet

We consider the quickest change detection problem where both the parameters of pre- and post- change distributions are unknown, which prevents the use of classical simple hypothesis testing.

Change Detection

Paper
Add Code

A Generalised Inverse Reinforcement Learning Framework

no code implementations • 25 May 2021 • Firas Jarboui, Vianney Perchet

The gloabal objective of inverse Reinforcement Learning (IRL) is to estimate the unknown cost function of some MDP base on observed trajectories generated by (approximate) optimal policies.

OpenAI Gym reinforcement-learning +1

Paper
Add Code

Quickest change detection for multi-task problems under unknown parameters

no code implementations • 1 Jan 2021 • Firas Jarboui, Vianney Perchet

We consider the quickest change detection problem where both the parameters of pre- and post- change distributions are unknown, which prevent the use of classical simple hypothesis testing.

Change Detection Two-sample testing

Paper
Add Code

Trajectory representation learning for Multi-Task NMRDPs planning

no code implementations • 25 Sep 2019 • Firas Jarboui, Vianney Perchet, Roman EGGER

Expanding Non Markovian Reward Decision Processes (NMRDP) into Markov Decision Processes (MDP) enables the use of state of the art Reinforcement Learning (RL) techniques to identify optimal policies.

Reinforcement Learning (RL) Representation Learning

Paper
Add Code

Markov Decision Process for MOOC users behavioral inference

no code implementations • 10 Jul 2019 • Firas Jarboui, Célya Gruson-daniel, Pierre Chanial, Alain Durmus, Vincent Rocchisani, Sophie-helene Goulet Ebongue, Anneliese Depoux, Wilfried Kirschenmann, Vianney Perchet

Studies on massive open online courses (MOOCs) users discuss the existence of typical profiles and their impact on the learning process of the students.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.