Search Results for author: Firas Jarboui

Found 8 papers, 0 papers with code

Delayed Geometric Discounts: An Alternative Criterion for Reinforcement Learning

no code implementations26 Sep 2022 Firas Jarboui, Ahmed Akakzia

The endeavor of artificial intelligence (AI) is to design autonomous agents capable of achieving complex tasks.

reinforcement-learning Reinforcement Learning (RL)

Unsupervised Neural Hidden Markov Models with a Continuous latent state space

no code implementations10 Jun 2021 Firas Jarboui, Vianney Perchet

We introduce a new procedure to neuralize unsupervised Hidden Markov Models in the continuous case.

Offline Inverse Reinforcement Learning

no code implementations9 Jun 2021 Firas Jarboui, Vianney Perchet

Current solutions either solve a behaviour cloning problem (which does not leverage the exploratory data) or a reinforced imitation learning problem (using a fixed cost function that discriminates available exploratory trajectories from expert ones).

Data Augmentation Imitation Learning +4

Quickest change detection with unknown parameters: Constant complexity and near optimality

no code implementations9 Jun 2021 Firas Jarboui, Viannet Perchet

We consider the quickest change detection problem where both the parameters of pre- and post- change distributions are unknown, which prevents the use of classical simple hypothesis testing.

Change Detection

A Generalised Inverse Reinforcement Learning Framework

no code implementations25 May 2021 Firas Jarboui, Vianney Perchet

The gloabal objective of inverse Reinforcement Learning (IRL) is to estimate the unknown cost function of some MDP base on observed trajectories generated by (approximate) optimal policies.

OpenAI Gym reinforcement-learning +1

Quickest change detection for multi-task problems under unknown parameters

no code implementations1 Jan 2021 Firas Jarboui, Vianney Perchet

We consider the quickest change detection problem where both the parameters of pre- and post- change distributions are unknown, which prevent the use of classical simple hypothesis testing.

Change Detection Two-sample testing

Trajectory representation learning for Multi-Task NMRDPs planning

no code implementations25 Sep 2019 Firas Jarboui, Vianney Perchet, Roman EGGER

Expanding Non Markovian Reward Decision Processes (NMRDP) into Markov Decision Processes (MDP) enables the use of state of the art Reinforcement Learning (RL) techniques to identify optimal policies.

Reinforcement Learning (RL) Representation Learning

Markov Decision Process for MOOC users behavioral inference

no code implementations10 Jul 2019 Firas Jarboui, Célya Gruson-daniel, Pierre Chanial, Alain Durmus, Vincent Rocchisani, Sophie-helene Goulet Ebongue, Anneliese Depoux, Wilfried Kirschenmann, Vianney Perchet

Studies on massive open online courses (MOOCs) users discuss the existence of typical profiles and their impact on the learning process of the students.

Cannot find the paper you are looking for? You can Submit a new open access paper.