no code implementations • 1 Mar 2020 • Masashi Okada, Norio Kosaka, Tadahiro Taniguchi
In this paper, we extend VI-MPC and PaETS, which have been originally introduced in previous literature, to address partially observable cases.
1 code implementation • ICLR 2022 • Ayush Jain, Norio Kosaka, Kyung-Min Kim, Joseph J Lim
Intelligent agents can solve tasks in a variety of ways depending on the action set at their disposal.
1 code implementation • NeurIPS 2023 • Gaon An, Junhyeok Lee, Xingdong Zuo, Norio Kosaka, Kyung-Min Kim, Hyun Oh Song
We apply our algorithm to offline RL tasks with actual human preference labels and show that our algorithm outperforms or is on par with the existing PbRL methods.