no code implementations • 9 Dec 2023 • Jaeuk Shin, Giho Kim, Howon Lee, Joonho Han, Insoon Yang
Designing a competent meta-reinforcement learning (meta-RL) algorithm in terms of data usage remains a central challenge to be tackled for its successful real-world applications.
no code implementations • 28 Nov 2022 • MinGyu Park, Jaeuk Shin, Insoon Yang
Inspired by the quasi-Newton interpretation of AA, we propose a maximum entropy variant of QMDP, which we call soft QMDP, to fully benefit from AA.
1 code implementation • 27 Oct 2020 • Jeongho Kim, Jaeuk Shin, Insoon Yang
In this paper, we propose Q-learning algorithms for continuous-time deterministic optimal control problems with Lipschitz continuous controls.