Search Results for author: Soichiro Nishimori

Found 4 papers, 2 papers with code

A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees

1 code implementation31 Jan 2024 Toshinori Kitamura, Tadashi Kozuno, Masahiro Kato, Yuki Ichihara, Soichiro Nishimori, Akiyoshi Sannai, Sho Sonoda, Wataru Kumagai, Yutaka Matsuo

We study a primal-dual reinforcement learning (RL) algorithm for the online constrained Markov decision processes (CMDP) problem, wherein the agent explores an optimal policy that maximizes return while satisfying constraints.

Reinforcement Learning (RL)

End-to-End Policy Gradient Method for POMDPs and Explainable Agents

no code implementations19 Apr 2023 Soichiro Nishimori, Sotetsu Koyamada, Shin Ishii

We proposed an RL algorithm that estimates the hidden states by end-to-end training, and visualize the estimation as a state-transition graph.

Autonomous Driving Decision Making +2

Cannot find the paper you are looking for? You can Submit a new open access paper.