Search Results for author: Norio Kosaka

Found 3 papers, 2 papers with code

Direct Preference-based Policy Optimization without Reward Modeling

1 code implementation NeurIPS 2023 Gaon An, Junhyeok Lee, Xingdong Zuo, Norio Kosaka, Kyung-Min Kim, Hyun Oh Song

We apply our algorithm to offline RL tasks with actual human preference labels and show that our algorithm outperforms or is on par with the existing PbRL methods.

Contrastive Learning Offline RL +1

PlaNet of the Bayesians: Reconsidering and Improving Deep Planning Network by Incorporating Bayesian Inference

no code implementations1 Mar 2020 Masashi Okada, Norio Kosaka, Tadahiro Taniguchi

In this paper, we extend VI-MPC and PaETS, which have been originally introduced in previous literature, to address partially observable cases.

Bayesian Inference Continuous Control +3

Cannot find the paper you are looking for? You can Submit a new open access paper.