Search Results for author: Takayuki Osa

Found 11 papers, 3 papers with code

Symmetric Q-learning: Reducing Skewness of Bellman Error in Online Reinforcement Learning

no code implementations • 12 Mar 2024 • Motoki Omura, Takayuki Osa, Yusuke Mukuta, Tatsuya Harada

In deep reinforcement learning, estimating the value function to evaluate the quality of states and actions is essential.

Continuous Control Q-Learning +1

Paper
Add Code

Robustifying a Policy in Multi-Agent RL with Diverse Cooperative Behaviors and Adversarial Style Sampling for Assistive Tasks

no code implementations • 1 Mar 2024 • Takayuki Osa, Tatsuya Harada

We demonstrate that policies trained with a popular deep RL method are vulnerable to changes in policies of other agents and that the proposed framework improves the robustness against such changes.

Reinforcement Learning (RL)

Paper
Add Code

Motion Planning by Learning the Solution Manifold in Trajectory Optimization

no code implementations • 13 Jul 2021 • Takayuki Osa

The experimental results indicate that the trained model represents an infinite set of homotopic solutions for motion planning problems.

Motion Planning

Paper
Add Code

Discovering Diverse Solutions in Deep Reinforcement Learning by Maximizing State-Action-Based Mutual Information

1 code implementation • 12 Mar 2021 • Takayuki Osa, Voot Tangkaratt, Masashi Sugiyama

In our method, a policy conditioned on a continuous or discrete latent variable is trained by directly maximizing the variational lower bound of the mutual information, instead of using the mutual information as unsupervised rewards as in previous studies.

Continuous Control

Paper
Code

Learning the Solution Manifold in Optimization and Its Application in Motion Planning

no code implementations • 24 Jul 2020 • Takayuki Osa

The experimental results indicate that the solution manifold can be learned with the proposed algorithm, and the trained model represents an infinite set of homotopic solutions for motion-planning problems.

Density Estimation Motion Planning

Paper
Add Code

Meta-Model-Based Meta-Policy Optimization

no code implementations • 4 Jun 2020 • Takuya Hiraoka, Takahisa Imagawa, Voot Tangkaratt, Takayuki Osa, Takashi Onishi, Yoshimasa Tsuruoka

Model-based meta-reinforcement learning (RL) methods have recently been shown to be a promising approach to improving the sample efficiency of RL in multi-task settings.

Continuous Control Meta-Learning +3

Paper
Add Code

Goal-Conditioned Variational Autoencoder Trajectory Primitives with Continuous and Discrete Latent Codes

no code implementations • 9 Dec 2019 • Takayuki Osa, Shuhei Ikemoto

The learned decoder can be used as a motion planner in which the user can specify the goal position and the trajectory types by setting the latent variables.

Data Augmentation Imitation Learning +1

Paper
Add Code

Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics

2 code implementations • 3 Oct 2019 • Johannes Ackermann, Volker Gabler, Takayuki Osa, Masashi Sugiyama

Finally, we investigate the application of multi-agent methods to high-dimensional robotic tasks and show that our approach can be used to learn decentralized policies in this domain.

Ranked #1 on Multi-agent Reinforcement Learning on ParticleEnvs Cooperative Communication

Multi-agent Reinforcement Learning Reinforcement Learning (RL)

110

Paper
Code

Hierarchical Reinforcement Learning via Advantage-Weighted Information Maximization

1 code implementation • ICLR 2019 • Takayuki Osa, Voot Tangkaratt, Masashi Sugiyama

However, identifying the hierarchical policy structure that enhances the performance of RL is not a trivial task.

Continuous Control Hierarchical Reinforcement Learning +2

Paper
Code

An Algorithmic Perspective on Imitation Learning

no code implementations • 16 Nov 2018 • Takayuki Osa, Joni Pajarinen, Gerhard Neumann, J. Andrew Bagnell, Pieter Abbeel, Jan Peters

This process of learning from demonstrations, and the study of algorithms to do so, is called imitation learning.

Imitation Learning Learning Theory

Paper
Add Code

Hierarchical Policy Search via Return-Weighted Density Estimation

no code implementations • 28 Nov 2017 • Takayuki Osa, Masashi Sugiyama

Learning an optimal policy from a multi-modal reward function is a challenging problem in reinforcement learning (RL).

Density Estimation Motion Planning +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.