no code implementations • 15 Feb 2024 • Hongju Park, Mohamad Kazem Shirani Faradonbeh
Accordingly, a fundamental problem is that of balancing exploration (i. e., pulling different arms to learn their parameters), versus exploitation (i. e., pulling the best arms to gain reward).
no code implementations • 20 Jun 2022 • Mohamad Kazem Shirani Faradonbeh, Mohamad Sadegh Shirani Faradonbeh, Mohsen Bayati
To the best of our knowledge, this is the first such result for Thompson sampling in a diffusion process control problem.
no code implementations • 9 Jun 2022 • Mohamad Kazem Shirani Faradonbeh
This work theoretically studies a ubiquitous reinforcement learning policy for controlling the canonical model of continuous-time stochastic linear-quadratic systems.
no code implementations • 10 Apr 2022 • Hongju Park, Mohamad Kazem Shirani Faradonbeh
Contextual bandits are canonical models for sequential decision-making under uncertainty in environments with time-varying components.
no code implementations • 2 Feb 2022 • Hongju Park, Mohamad Kazem Shirani Faradonbeh
Contextual bandits are widely-used in the study of learning-based control policies for finite action spaces.
no code implementations • 1 Jan 2022 • Mohamad Kazem Shirani Faradonbeh, Aditya Modi
Learning-based control of linear systems received a lot of attentions recently.
no code implementations • 30 Dec 2021 • Mohamad Kazem Shirani Faradonbeh, Mohamad Sadegh Shirani Faradonbeh
Linear dynamical systems are canonical models for learning-based control of plants with uncertain dynamics.
no code implementations • 21 Dec 2021 • Aditya Modi, Mohamad Kazem Shirani Faradonbeh, Ambuj Tewari, George Michailidis
Linear time-invariant systems are very popular models in system theory and applications.
no code implementations • 23 Oct 2021 • Hongju Park, Mohamad Kazem Shirani Faradonbeh
We propose a Thompson Sampling algorithm for partially observable contextual multi-armed bandits, and establish theoretical performance guarantees.
no code implementations • 16 Sep 2021 • Mohamad Kazem Shirani Faradonbeh, Mohamad Sadegh Shirani Faradonbeh
Linear dynamical systems that obey stochastic differential equations are canonical models.
no code implementations • 17 May 2019 • Davoud Ataee Tarzanagh, Mohamad Kazem Shirani Faradonbeh, George Michailidis
Principal components analysis (PCA) is a widely used dimension reduction technique with an extensive range of applications.
no code implementations • 16 May 2019 • Mohamad Kazem Shirani Faradonbeh, Ambuj Tewari, George Michailidis
We provide numerical analyses for the performance of two methods: stochastic feedback, and stochastic parameter.
no code implementations • 14 Mar 2019 • Mohamad Kazem Shirani Faradonbeh, Ambuj Tewari, George Michailidis
In decision making problems for continuous state and action spaces, linear dynamical models are widely employed.
no code implementations • 10 Nov 2018 • Mohamad Kazem Shirani Faradonbeh, Ambuj Tewari, George Michailidis
This paper studies adaptive algorithms for simultaneous regulation (i. e., control) and estimation (i. e., learning) of Multiple Input Multiple Output (MIMO) linear dynamical systems.
no code implementations • 22 Jul 2018 • Mohamad Kazem Shirani Faradonbeh, Ambuj Tewari, George Michailidis
There are only a few existing non-asymptotic results and a full treatment of the problem is not currently available.
no code implementations • 20 Nov 2017 • Mohamad Kazem Shirani Faradonbeh, Ambuj Tewari, George Michailidis
The main challenge for adaptive regulation of linear-quadratic systems is the trade-off between identification and control.