no code implementations • ICML 2020 • Byung-Jun Lee, Jongmin Lee, Peter Vrancx, Dongho Kim, Kee-Eung Kim
We consider the batch reinforcement learning problem where the agent needs to learn only from a fixed batch of data, without further interaction with the environment.
no code implementations • 18 Jan 2024 • Hee-Jun Ahn, Seong-Woong Shim, Byung-Jun Lee
In offline imitation learning (IL), we generally assume only a handful of expert trajectories and a supplementary offline dataset from suboptimal behaviors to learn the expert policy.
1 code implementation • 24 Oct 2022 • Haanvid Lee, Jongmin Lee, Yunseon Choi, Wonseok Jeon, Byung-Jun Lee, Yung-Kyun Noh, Kee-Eung Kim
We consider local kernel metric learning for off-policy evaluation (OPE) of deterministic policies in contextual bandits with continuous action spaces.
1 code implementation • 21 Jun 2021 • Jongmin Lee, Wonseok Jeon, Byung-Jun Lee, Joelle Pineau, Kee-Eung Kim
We consider the offline reinforcement learning (RL) setting where the agent aims to optimize the policy solely from the data without further environment interactions.
no code implementations • ICLR 2021 • Deunsol Yoon, Sunghoon Hong, Byung-Jun Lee, Kee-Eung Kim
Safe and reliable electricity transmission in power grids is crucial for modern society.
no code implementations • ICLR 2021 • Byung-Jun Lee, Jongmin Lee, Kee-Eung Kim
We present a new objective for model learning motivated by recent advances in the estimation of stationary distribution corrections.