Search Results for author: Sebastien Gros

Found 24 papers, 1 papers with code

Probabilistic reachable sets of stochastic nonlinear systems with contextual uncertainties

no code implementations19 Mar 2024 Xun Shen, Ye Wang, Kazumune Hashimoto, Yuhu Wu, Sebastien Gros

The existing methods of computing probabilistic reachable sets normally assume that the uncertainties are independent of the state.

Density Estimation

Once upon a time step: A closed-loop approach to robust MPC design

no code implementations20 Mar 2023 Anilkumar Parsi, Marcell Bartos, Amber Srivastava, Sebastien Gros, Roy S. Smith

A novel perspective on the design of robust model predictive control (MPC) methods is presented, whereby closed-loop constraint satisfaction is ensured using recursive feasibility of the MPC optimization.

LEMMA Model Predictive Control

Deep active learning for nonlinear system identification

no code implementations24 Feb 2023 Erlend Torje Berg Lundby, Adil Rasheed, Ivar Johan Halvorsen, Dirk Reinhardt, Sebastien Gros, Jan Tommy Gravdahl

This simulated dataset can be used in a static deep active learning acquisition scheme referred to as global explorations.

Active Learning

Equivalence of Optimality Criteria for Markov Decision Process and Model Predictive Control

no code implementations9 Oct 2022 Arash Bahari Kordabad, Mario Zanon, Sebastien Gros

This paper shows that the optimal policy and value functions of a Markov Decision Process (MDP), either discounted or not, can be captured by a finite-horizon undiscounted Optimal Control Problem (OCP), even if based on an inexact model.

Model Predictive Control reinforcement-learning +1

Policy Gradient Reinforcement Learning for Uncertain Polytopic LPV Systems based on MHE-MPC

no code implementations10 Jun 2022 Hossein Nejatbakhsh Esfahani, Sebastien Gros

In this paper, we propose a learning-based Model Predictive Control (MPC) approach for the polytopic Linear Parameter-Varying (LPV) systems with inexact scheduling parameters (as exogenous signals with inexact bounds), where the Linear Time Invariant (LTI) models (vertices) captured by combinations of the scheduling parameters becomes wrong.

Model Predictive Control reinforcement-learning +2

Bridging the gap between QP-based and MPC-based RL

no code implementations18 May 2022 Shambhuraj Sawant, Sebastien Gros

We propose simple tools to promote structures in the QP, pushing it to resemble a linear MPC scheme.

Interpretable Battery Cycle Life Range Prediction Using Early Degradation Data at Cell Level

no code implementations26 Apr 2022 Huang Zhang, Yang Su, Faisal Altaf, Torsten Wik, Sebastien Gros

For that reason, various data-driven methods have been proposed for point prediction of battery cycle life with minimum knowledge of the battery degradation mechanisms.

Decision Making Management +1

Functional Stability of Discounted Markov Decision Processes Using Economic MPC Dissipativity Theory

no code implementations31 Mar 2022 Arash Bahari Kordabad, Sebastien Gros

This paper discusses the functional stability of closed-loop Markov Chains under optimal policies resulting from a discounted optimality criterion, forming Markov Decision Processes (MDPs).

Model Predictive Control Q-Learning +1

Quasi-Newton Iteration in Deterministic Policy Gradient

no code implementations25 Mar 2022 Arash Bahari Kordabad, Hossein Nejatbakhsh Esfahani, WenQi Cai, Sebastien Gros

We show that the approximate Hessian converges to the exact Hessian at the optimal policy, and allows for a superlinear convergence in the learning, provided that the policy parametrization is rich.

reinforcement-learning Reinforcement Learning (RL)

A Semi-Distributed Interior Point Algorithm for Optimal Coordination of Automated Vehicles at Intersections

no code implementations19 Nov 2021 Robert Hult, Mario Zanon, Sebastien Gros, Paolo Falcone

In this paper, we consider the optimal coordination of automated vehicles at intersections under fixed crossing orders.

Distributed Optimization

Backstepping-based Integral Sliding Mode Control with Time Delay Estimation for Autonomous Underwater Vehicles

no code implementations19 Nov 2021 Hossein Nejatbakhsh Esfahani, Behdad Aminian, Esten Ingar Grøtli, Sebastien Gros

The aim of this paper is to propose a high performance control approach for trajectory tracking of Autonomous Underwater Vehicles (AUVs).

Optimization of the Model Predictive Control Meta-Parameters Through Reinforcement Learning

no code implementations7 Nov 2021 Eivind Bøhn, Sebastien Gros, Signe Moe, Tor Arne Johansen

Its high computational complexity results in high power consumption from the control algorithm, which could account for a significant share of the energy resources in battery-powered embedded systems.

Model Predictive Control reinforcement-learning +1

Verification of Dissipativity and Evaluation of Storage Function in Economic Nonlinear MPC using Q-Learning

no code implementations24 May 2021 Arash Bahari Kordabad, Sebastien Gros

In the Economic Nonlinear Model Predictive (ENMPC) context, closed-loop stability relates to the existence of a storage function satisfying a dissipation inequality.

Q-Learning Reinforcement Learning (RL)

Approximate Robust NMPC using Reinforcement Learning

no code implementations6 Apr 2021 Hossein Nejatbakhsh Esfahani, Arash Bahari Kordabad, Sebastien Gros

We present a Reinforcement Learning-based Robust Nonlinear Model Predictive Control (RL-RNMPC) framework for controlling nonlinear systems in the presence of disturbances and uncertainties.

Model Predictive Control reinforcement-learning +1

Bias Correction in Deterministic Policy Gradient Using Robust MPC

no code implementations6 Apr 2021 Arash Bahari Kordabad, Hossein Nejatbakhsh Esfahani, Sebastien Gros

In this paper, we discuss the deterministic policy gradient using the Actor-Critic methods based on the linear compatible advantage function approximator, where the input spaces are continuous.

Model Predictive Control

MPC-based Reinforcement Learning for Economic Problems with Application to Battery Storage

no code implementations6 Apr 2021 Arash Bahari Kordabad, WenQi Cai, Sebastien Gros

In this paper, we are interested in optimal control problems with purely economic costs, which often yield optimal policies having a (nearly) bang-bang structure.

Model Predictive Control reinforcement-learning +1

Reinforcement Learning based on MPC/MHE for Unmodeled and Partially Observable Dynamics

no code implementations22 Mar 2021 Hossein Nejatbakhsh Esfahani, Arash Bahari Kordabad, Sebastien Gros

This paper proposes an observer-based framework for solving Partially Observable Markov Decision Processes (POMDPs) when an accurate model is not available.

Model Predictive Control reinforcement-learning +1

Reinforcement Learning of the Prediction Horizon in Model Predictive Control

no code implementations22 Feb 2021 Eivind Bøhn, Sebastien Gros, Signe Moe, Tor Arne Johansen

Model predictive control (MPC) is a powerful trajectory optimization control technique capable of controlling complex nonlinear systems while respecting system constraints and ensuring safe operation.

Model Predictive Control reinforcement-learning +1

Optimization of the Model Predictive Control Update Interval Using Reinforcement Learning

1 code implementation26 Nov 2020 Eivind Bøhn, Sebastien Gros, Signe Moe, Tor Arne Johansen

In control applications there is often a compromise that needs to be made with regards to the complexity and performance of the controller and the computational resources that are available.

Model Predictive Control reinforcement-learning +1

Reinforcement Learning for Mixed-Integer Problems Based on MPC

no code implementations3 Apr 2020 Sebastien Gros, Mario Zanon

Model Predictive Control has been recently proposed as policy approximation for Reinforcement Learning, offering a path towards safe and explainable Reinforcement Learning.

Model Predictive Control Q-Learning +2

Safe Reinforcement Learning via Projection on a Safe Set: How to Achieve Optimality?

no code implementations2 Apr 2020 Sebastien Gros, Mario Zanon, Alberto Bemporad

For all its successes, Reinforcement Learning (RL) still struggles to deliver formal guarantees on the closed-loop behavior of the learned policy.

Policy Gradient Methods Q-Learning +3

Cannot find the paper you are looking for? You can Submit a new open access paper.