no code implementations • 12 Mar 2024 • Motoki Omura, Takayuki Osa, Yusuke Mukuta, Tatsuya Harada
In deep reinforcement learning, estimating the value function to evaluate the quality of states and actions is essential.
no code implementations • 1 Mar 2024 • Takayuki Osa, Tatsuya Harada
We demonstrate that policies trained with a popular deep RL method are vulnerable to changes in policies of other agents and that the proposed framework improves the robustness against such changes.
no code implementations • 13 Jul 2021 • Takayuki Osa
The experimental results indicate that the trained model represents an infinite set of homotopic solutions for motion planning problems.
1 code implementation • 12 Mar 2021 • Takayuki Osa, Voot Tangkaratt, Masashi Sugiyama
In our method, a policy conditioned on a continuous or discrete latent variable is trained by directly maximizing the variational lower bound of the mutual information, instead of using the mutual information as unsupervised rewards as in previous studies.
no code implementations • 24 Jul 2020 • Takayuki Osa
The experimental results indicate that the solution manifold can be learned with the proposed algorithm, and the trained model represents an infinite set of homotopic solutions for motion-planning problems.
no code implementations • 4 Jun 2020 • Takuya Hiraoka, Takahisa Imagawa, Voot Tangkaratt, Takayuki Osa, Takashi Onishi, Yoshimasa Tsuruoka
Model-based meta-reinforcement learning (RL) methods have recently been shown to be a promising approach to improving the sample efficiency of RL in multi-task settings.
no code implementations • 9 Dec 2019 • Takayuki Osa, Shuhei Ikemoto
The learned decoder can be used as a motion planner in which the user can specify the goal position and the trajectory types by setting the latent variables.
2 code implementations • 3 Oct 2019 • Johannes Ackermann, Volker Gabler, Takayuki Osa, Masashi Sugiyama
Finally, we investigate the application of multi-agent methods to high-dimensional robotic tasks and show that our approach can be used to learn decentralized policies in this domain.
Multi-agent Reinforcement Learning Reinforcement Learning (RL)
1 code implementation • ICLR 2019 • Takayuki Osa, Voot Tangkaratt, Masashi Sugiyama
However, identifying the hierarchical policy structure that enhances the performance of RL is not a trivial task.
no code implementations • 16 Nov 2018 • Takayuki Osa, Joni Pajarinen, Gerhard Neumann, J. Andrew Bagnell, Pieter Abbeel, Jan Peters
This process of learning from demonstrations, and the study of algorithms to do so, is called imitation learning.
no code implementations • 28 Nov 2017 • Takayuki Osa, Masashi Sugiyama
Learning an optimal policy from a multi-modal reward function is a challenging problem in reinforcement learning (RL).