Search Results for author: Motoya Ohnishi

Found 7 papers, 2 papers with code

Information Theoretic Regret Bounds for Online Nonlinear Control

1 code implementation • NeurIPS 2020 • Sham Kakade, Akshay Krishnamurthy, Kendall Lowrey, Motoya Ohnishi, Wen Sun

This work studies the problem of sequential control in an unknown, nonlinear dynamical system, where we model the underlying system dynamics as an unknown function in a known Reproducing Kernel Hilbert Space.

Continuous Control

Paper
Code

Koopman Spectrum Nonlinear Regulator and Provably Efficient Online Learning

1 code implementation • 30 Jun 2021 • Motoya Ohnishi, Isao Ishikawa, Kendall Lowrey, Masahiro Ikeda, Sham Kakade, Yoshinobu Kawahara

In this work, we present a novel paradigm of controlling nonlinear systems via the minimization of the Koopman spectrum cost: a cost over the Koopman operator of the controlled dynamics.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Continuous-time Value Function Approximation in Reproducing Kernel Hilbert Spaces

no code implementations • NeurIPS 2018 • Motoya Ohnishi, Masahiro Yukawa, Mikael Johansson, Masashi Sugiyama

Motivated by the success of reinforcement learning (RL) for discrete-time tasks such as AlphaGo and Atari games, there has been a recent surge of interest in using RL for continuous-time control of physical systems (cf.

Atari Games Gaussian Processes +2

Paper
Add Code

Barrier-Certified Adaptive Reinforcement Learning with Applications to Brushbot Navigation

no code implementations • 29 Jan 2018 • Motoya Ohnishi, Li Wang, Gennaro Notomista, Magnus Egerstedt

This paper presents a safe learning framework that employs an adaptive model learning algorithm together with barrier certificates for systems with possibly nonstationary agent dynamics.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Constraint Learning for Control Tasks with Limited Duration Barrier Functions

no code implementations • 26 Aug 2019 • Motoya Ohnishi, Gennaro Notomista, Masashi Sugiyama, Magnus Egerstedt

When deploying autonomous agents in unstructured environments over sustained periods of time, adaptability and robustness oftentimes outweigh optimality as a primary consideration.

Paper
Add Code

Dynamic Structure Estimation from Bandit Feedback

no code implementations • 2 Jun 2022 • Motoya Ohnishi, Isao Ishikawa, Yuko Kuroki, Masahiro Ikeda

This work present novel method for structure estimation of an underlying dynamical system.

Paper
Add Code

Signatures Meet Dynamic Programming: Generalizing Bellman Equations for Trajectory Following

no code implementations • 9 Dec 2023 • Motoya Ohnishi, Iretiayo Akinola, Jie Xu, Ajay Mandlekar, Fabio Ramos

As a specific case of our framework, we devise a model predictive control method for path tracking.

Model Predictive Control Time Series +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.