1 code implementation • 10 Dec 2021 • Giseung Park, Sungho Choi, Youngchul Sung
This paper proposes a new sequential model learning architecture to solve partially observable Markov decision problems.
Partially Observable Reinforcement Learning reinforcement-learning +1
no code implementations • 1 Jan 2021 • Giseung Park, Whiyoung Jung, Sungho Choi, Youngchul Sung
In this paper, we consider intrinsic reward generation for sparse-reward reinforcement learning based on model prediction errors.
1 code implementation • ICLR 2020 • Whiyoung Jung, Giseung Park, Youngchul Sung
In the proposed scheme, multiple identical learners with their own value-functions and policies share a common experience replay buffer, and search a good policy in collaboration with the guidance of the best policy information.
no code implementations • 25 Sep 2019 • Giseung Park, Whiyoung Jung, Sungho Choi, Youngchul Sung
In this paper, a new intrinsic reward generation method for sparse-reward reinforcement learning is proposed based on an ensemble of dynamics models.
no code implementations • 27 Sep 2018 • Whiyoung Jung, Giseung Park, Youngchul Sung
In this paper, a new interactive parallel learning scheme is proposed to enhance the performance of off-policy continuous-action reinforcement learning.