1 code implementation • 30 Aug 2023 • Jasmina Gajcin, James McCarthy, Rahul Nair, Radu Marinescu, Elizabeth Daly, Ivana Dusparic
Our approach allows the user to provide trajectory-level feedback on agent's behavior during training, which can be integrated as a reward shaping signal in the following training iteration.
no code implementations • 18 Jul 2022 • James McCarthy, Rahul Nair, Elizabeth Daly, Radu Marinescu, Ivana Dusparic
Explainability of Reinforcement Learning (RL) policies remains a challenging research problem, particularly when considering RL in a safety context.