no code implementations • 22 Jul 2023 • Shuwa Miura
We consider the expressivity of Markov rewards in sequential decision making under uncertainty.
1 code implementation • 18 Apr 2023 • Kazumi Kasaura, Shuwa Miura, Tadashi Kozuno, Ryo Yonetani, Kenta Hoshino, Yohei Hosoe
This study presents a benchmark for evaluating action-constrained reinforcement learning (RL) algorithms.
no code implementations • 11 Mar 2017 • Shuwa Miura, Alex Fukunaga
Axioms can be used to model derived predicates in domain- independent planning models.