Search Results for author: Norio Kosaka

Found 3 papers, 2 papers with code

Direct Preference-based Policy Optimization without Reward Modeling

1 code implementation • NeurIPS 2023 • Gaon An, Junhyeok Lee, Xingdong Zuo, Norio Kosaka, Kyung-Min Kim, Hyun Oh Song

We apply our algorithm to offline RL tasks with actual human preference labels and show that our algorithm outperforms or is on par with the existing PbRL methods.

Contrastive Learning Offline RL +1

Paper
Code

Know Your Action Set: Learning Action Relations for Reinforcement Learning

1 code implementation • ICLR 2022 • Ayush Jain, Norio Kosaka, Kyung-Min Kim, Joseph J Lim

Intelligent agents can solve tasks in a variety of ways depending on the action set at their disposal.

Graph Attention Recommendation Systems +2

Paper
Code

PlaNet of the Bayesians: Reconsidering and Improving Deep Planning Network by Incorporating Bayesian Inference

no code implementations • 1 Mar 2020 • Masashi Okada, Norio Kosaka, Tadahiro Taniguchi

In this paper, we extend VI-MPC and PaETS, which have been originally introduced in previous literature, to address partially observable cases.

Bayesian Inference Continuous Control +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.