Search Results for author: Huy Hoang

Found 2 papers, 1 papers with code

SubIQ: Inverse Soft-Q Learning for Offline Imitation with Suboptimal Demonstrations

no code implementations20 Feb 2024 Huy Hoang, Tien Mai, Pradeep Varakantham

Most of the existing offline IL methods developed for this setting are based on behavior cloning or distribution matching, where the aim is to match the occupancy distribution of the imitation policy with that of the expert policy.

Imitation Learning Q-Learning

Imitate the Good and Avoid the Bad: An Incremental Approach to Safe Reinforcement Learning

1 code implementation16 Dec 2023 Huy Hoang, Tien Mai, Pradeep Varakantham

In an exhaustive set of experiments, we demonstrate that our approach is able to outperform top benchmark approaches for solving Constrained RL problems, with respect to expected cost, CVaR cost, or even unknown cost constraints.

Reinforcement Learning (RL) Safe Reinforcement Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.