Search Results for author: Sheelabhadra Dey

Found 1 papers, 1 papers with code

A Joint Imitation-Reinforcement Learning Framework for Reduced Baseline Regret

1 code implementation20 Sep 2022 Sheelabhadra Dey, Sumedh Pendurkar, Guni Sharon, Josiah P. Hanna

The learning process in JIRL assumes the availability of a baseline policy and is designed with two objectives in mind \textbf{(a)} leveraging the baseline's online demonstrations to minimize the regret w. r. t the baseline policy during training, and \textbf{(b)} eventually surpassing the baseline performance.

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.