Search Results for author: Wonjoon Goo

Found 8 papers, 5 papers with code

Know Your Boundaries: The Necessity of Explicit Behavioral Cloning in Offline RL

no code implementations1 Jun 2022 Wonjoon Goo, Scott Niekum

In this work, we argue that it is not only viable but beneficial to explicitly model the behavior policy for offline RL because the constraint can be realized in a stable way with the trained model.

D4RL Offline RL +1

A Ranking Game for Imitation Learning

no code implementations7 Feb 2022 Harshit Sikchi, Akanksha Saran, Wonjoon Goo, Scott Niekum

We propose a new framework for imitation learning -- treating imitation as a two-player ranking-based game between a policy and a reward.

Imitation Learning

You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL

no code implementations5 Oct 2021 Wonjoon Goo, Scott Niekum

The goal of offline reinforcement learning (RL) is to find an optimal policy given prerecorded trajectories.

D4RL Offline RL +1

Self-Supervised Online Reward Shaping in Sparse-Reward Environments

1 code implementation8 Mar 2021 Farzan Memarian, Wonjoon Goo, Rudolf Lioutikov, Scott Niekum, Ufuk Topcu

We introduce Self-supervised Online Reward Shaping (SORS), which aims to improve the sample efficiency of any RL algorithm in sparse-reward environments by automatically densifying rewards.

Local Nonparametric Meta-Learning

1 code implementation9 Feb 2020 Wonjoon Goo, Scott Niekum

A central goal of meta-learning is to find a learning rule that enables fast adaptation across a set of tasks, by learning the appropriate inductive bias for that set.

Inductive Bias Meta-Learning

Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations

3 code implementations12 Apr 2019 Daniel S. Brown, Wonjoon Goo, Prabhat Nagarajan, Scott Niekum

A critical flaw of existing inverse reinforcement learning (IRL) methods is their inability to significantly outperform the demonstrator.

Imitation Learning reinforcement-learning +1

One-Shot Learning of Multi-Step Tasks from Observation via Activity Localization in Auxiliary Video

1 code implementation29 Jun 2018 Wonjoon Goo, Scott Niekum

Due to burdensome data requirements, learning from demonstration often falls short of its promise to allow users to quickly and naturally program robots.

One-Shot Learning Task 2

Cannot find the paper you are looking for? You can Submit a new open access paper.