no code implementations • 20 Feb 2024 • Huy Hoang, Tien Mai, Pradeep Varakantham
Most of the existing offline IL methods developed for this setting are based on behavior cloning or distribution matching, where the aim is to match the occupancy distribution of the imitation policy with that of the expert policy.
1 code implementation • 16 Dec 2023 • Huy Hoang, Tien Mai, Pradeep Varakantham
In an exhaustive set of experiments, we demonstrate that our approach is able to outperform top benchmark approaches for solving Constrained RL problems, with respect to expected cost, CVaR cost, or even unknown cost constraints.