no code implementations • 18 Mar 2021 • Nir Baram, Guy Tennenholtz, Shie Mannor
However, using mixture policies in the Maximum Entropy (MaxEnt) framework is not straightforward.
no code implementations • 22 Feb 2021 • Nir Baram, Guy Tennenholtz, Shie Mannor
Maximum Entropy (MaxEnt) reinforcement learning is a powerful learning paradigm which seeks to maximize return under entropy regularization.
no code implementations • 25 Sep 2019 • Nir Baram, Shie Mannor
Model-based imitation learning methods require full knowledge of the transition kernel for policy evaluation.
no code implementations • 16 Sep 2018 • Nir Baram, Shie Mannor
We denote this setup as \textit{Inspiration Learning} - knowledge transfer between agents that operate in different action spaces.
no code implementations • ICML 2017 • Nir Baram, Oron Anschel, Itai Caspi, Shie Mannor
Generative Adversarial Networks (GANs) have been successfully applied to the problem of policy imitation in a model-free setup.
no code implementations • 7 Dec 2016 • Nir Baram, Oron Anschel, Shie Mannor
A model-based approach for the problem of adversarial imitation learning.
no code implementations • ICML 2017 • Oron Anschel, Nir Baram, Nahum Shimkin
Instability and variability of Deep Reinforcement Learning (DRL) algorithms tend to adversely affect their performance.
no code implementations • 16 Jun 2016 • Nir Baram, Tom Zahavy, Shie Mannor
Deep Reinforcement Learning (DRL) is a trending field of research, showing great promise in challenging problems such as playing Atari, solving Go and controlling robots.