no code implementations • ICML 2020 • Orlando Romero, Mouhacine Benosman
In this paper, we investigate a Lyapunov-like differential inequality that allows us to establish finite-time stability of a continuous-time state-space dynamical system represented via a multivariate ordinary differential equation or differential inclusion.
1 code implementation • 5 Dec 2024 • James Queeney, Xiaoyi Cai, Mouhacine Benosman, Jonathan P. How
The reliable deployment of deep reinforcement learning in real-world settings requires the ability to generalize across a variety of conditions, including both in-distribution scenarios seen during training as well as novel out-of-distribution scenarios.
no code implementations • 1 Mar 2024 • Xiangyuan Zhang, Saviz Mowlavi, Mouhacine Benosman, Tamer Başar
The PO step fine-tunes the model-based controller to compensate for the modeling error from dimensionality reduction.
no code implementations • 23 Feb 2024 • Xiaoyu Xie, Saviz Mowlavi, Mouhacine Benosman
Spatiotemporal modeling is critical for understanding complex systems across various scientific and engineering disciplines, but governing equations are often not fully known or computationally intractable due to inherent system complexity.
no code implementations • 19 Dec 2023 • Saviz Mowlavi, Mouhacine Benosman
Designing estimation algorithms for systems governed by partial differential equations (PDEs) such as fluid flows is challenging due to the high-dimensional and oftentimes nonlinear nature of the dynamics, as well as their dependence on unobserved physical parameters.
1 code implementation • 30 Nov 2023 • Xiangyuan Zhang, Weichao Mao, Saviz Mowlavi, Mouhacine Benosman, Tamer Başar
This project serves the learning for dynamics & control (L4DC) community, aiming to explore key questions: the convergence of RL algorithms in learning control policies; the stability and robustness issues of learning-based controllers; and the scalability of RL algorithms to high- and potentially infinite-dimensional systems.
1 code implementation • 9 Sep 2023 • Xiangyuan Zhang, Saviz Mowlavi, Mouhacine Benosman, Tamer Başar
We introduce the receding-horizon policy gradient (RHPG) algorithm, the first PG algorithm with provable global convergence in learning the optimal linear estimator designs, i. e., the Kalman filter (KF).
no code implementations • 20 Jan 2023 • Saviz Mowlavi, Mouhacine Benosman
In systems governed by nonlinear partial differential equations such as fluid flows, the design of state estimators such as Kalman filters relies on a reduced-order model (ROM) that projects the original high-dimensional dynamics onto a computationally tractable low-dimensional space.
no code implementations • 27 Apr 2022 • Weidong Cao, Mouhacine Benosman, Xuan Zhang, Rui Ma
The design automation of analog circuits is a longstanding challenge.
no code implementations • 26 Feb 2022 • Weidong Cao, Mouhacine Benosman, Xuan Zhang, Rui Ma
The design automation of analog circuits is a longstanding challenge in the integrated circuit field.
no code implementations • 29 Sep 2021 • Saviz Mowlavi, Mouhacine Benosman, Saleh Nabi
In high-dimensional nonlinear systems such as fluid flows, the design of state estimators such as Kalman filters relies on a reduced-order model (ROM) of the dynamics.
no code implementations • 5 Aug 2021 • Reazul Hasan Russel, Mouhacine Benosman, Jeroen van Baar, Radu Corcodel
Safety and robustness are two desired properties for any reinforcement learning algorithm.
no code implementations • 1 Jan 2021 • Mouhacine Benosman, Orlando Romero, Anoop Cherian
In this paper, we investigate in the context of deep neural networks, the performance of several discretization algorithms for two first-order finite-time optimization flows.
no code implementations • 17 Dec 2020 • Martin Guay, Mouhacine Benosman
The averaged Newton seeking system is shown to achieve finite-time stability of the unknown optimum of the static map.
Optimization and Control
1 code implementation • 10 Oct 2020 • Reazul Hasan Russel, Mouhacine Benosman, Jeroen van Baar
In this paper, we focus on the problem of robustifying reinforcement learning (RL) algorithms with respect to model uncertainties.
no code implementations • 6 Oct 2020 • Siqi Zhang, Mouhacine Benosman, Orlando Romero
In this study, we investigate the performance of two novel first-order optimization algorithms, namely the rescaled-gradient flow (RGF) and the signed-gradient flow (SGF).
no code implementations • 12 May 2020 • Ankush Chakrabarty, Mouhacine Benosman
Data generated from dynamical systems with unknown dynamics enable the learning of state observers that are: robust to modeling error, computationally tractable to design, and capable of operating with guaranteed performance.
no code implementations • 22 Jan 2020 • Patrik Kolaric, Devesh K. Jha, Arvind U. Raghunathan, Frank L. Lewis, Mouhacine Benosman, Diego Romeres, Daniel Nikovski
Motivated by these problems, we try to formulate the problem of trajectory optimization and local policy synthesis as a single optimization problem.
Model-based Reinforcement Learning
reinforcement-learning
+2
no code implementations • 18 Dec 2019 • Orlando Romero, Mouhacine Benosman
In this paper, we propose two discontinuous dynamical systems in continuous time with guaranteed prescribed finite-time local convergence to strict local minima of a given cost function.