Search Results for author: Huy Hoang

Found 2 papers, 1 papers with code

SubIQ: Inverse Soft-Q Learning for Offline Imitation with Suboptimal Demonstrations

no code implementations • 20 Feb 2024 • Huy Hoang, Tien Mai, Pradeep Varakantham

Most of the existing offline IL methods developed for this setting are based on behavior cloning or distribution matching, where the aim is to match the occupancy distribution of the imitation policy with that of the expert policy.

Imitation Learning Q-Learning

Paper
Add Code

Imitate the Good and Avoid the Bad: An Incremental Approach to Safe Reinforcement Learning

1 code implementation • 16 Dec 2023 • Huy Hoang, Tien Mai, Pradeep Varakantham

In an exhaustive set of experiments, we demonstrate that our approach is able to outperform top benchmark approaches for solving Constrained RL problems, with respect to expected cost, CVaR cost, or even unknown cost constraints.

Reinforcement Learning (RL) Safe Reinforcement Learning

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.