Search Results for author: Mohamad Kazem Shirani Faradonbeh

Found 16 papers, 0 papers with code

Thompson Sampling in Partially Observable Contextual Bandits

no code implementations • 15 Feb 2024 • Hongju Park, Mohamad Kazem Shirani Faradonbeh

Accordingly, a fundamental problem is that of balancing exploration (i. e., pulling different arms to learn their parameters), versus exploitation (i. e., pulling the best arms to gain reward).

Decision Making Decision Making Under Uncertainty +2

Paper
Add Code

Thompson Sampling Efficiently Learns to Control Diffusion Processes

no code implementations • 20 Jun 2022 • Mohamad Kazem Shirani Faradonbeh, Mohamad Sadegh Shirani Faradonbeh, Mohsen Bayati

To the best of our knowledge, this is the first such result for Thompson sampling in a diffusion process control problem.

Decision Making Thompson Sampling

Paper
Add Code

Regret Analysis of Certainty Equivalence Policies in Continuous-Time Linear-Quadratic Systems

no code implementations • 9 Jun 2022 • Mohamad Kazem Shirani Faradonbeh

This work theoretically studies a ubiquitous reinforcement learning policy for controlling the canonical model of continuous-time stochastic linear-quadratic systems.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Worst-case Performance of Greedy Policies in Bandits with Imperfect Context Observations

no code implementations • 10 Apr 2022 • Hongju Park, Mohamad Kazem Shirani Faradonbeh

Contextual bandits are canonical models for sequential decision-making under uncertainty in environments with time-varying components.

Decision Making Decision Making Under Uncertainty +1

Paper
Add Code

Efficient Algorithms for Learning to Control Bandits with Unobserved Contexts

no code implementations • 2 Feb 2022 • Hongju Park, Mohamad Kazem Shirani Faradonbeh

Contextual bandits are widely-used in the study of learning-based control policies for finite action spaces.

Multi-Armed Bandits

Paper
Add Code

Joint Learning-Based Stabilization of Multiple Unknown Linear Systems

no code implementations • 1 Jan 2022 • Mohamad Kazem Shirani Faradonbeh, Aditya Modi

Learning-based control of linear systems received a lot of attentions recently.

Reinforcement Learning (RL)

Paper
Add Code

Bayesian Algorithms Learn to Stabilize Unknown Continuous-Time Systems

no code implementations • 30 Dec 2021 • Mohamad Kazem Shirani Faradonbeh, Mohamad Sadegh Shirani Faradonbeh

Linear dynamical systems are canonical models for learning-based control of plants with uncertain dynamics.

Paper
Add Code

Joint Learning of Linear Time-Invariant Dynamical Systems

no code implementations • 21 Dec 2021 • Aditya Modi, Mohamad Kazem Shirani Faradonbeh, Ambuj Tewari, George Michailidis

Linear time-invariant systems are very popular models in system theory and applications.

Paper
Add Code

Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits

no code implementations • 23 Oct 2021 • Hongju Park, Mohamad Kazem Shirani Faradonbeh

We propose a Thompson Sampling algorithm for partially observable contextual multi-armed bandits, and establish theoretical performance guarantees.

Decision Making Multi-Armed Bandits +3

Paper
Add Code

Reinforcement Learning Policies in Continuous-Time Linear Systems

no code implementations • 16 Sep 2021 • Mohamad Kazem Shirani Faradonbeh, Mohamad Sadegh Shirani Faradonbeh

Linear dynamical systems that obey stochastic differential equations are canonical models.

Decision Making reinforcement-learning

Paper
Add Code

Online Distributed Estimation of Principal Eigenspaces

no code implementations • 17 May 2019 • Davoud Ataee Tarzanagh, Mohamad Kazem Shirani Faradonbeh, George Michailidis

Principal components analysis (PCA) is a widely used dimension reduction technique with an extensive range of applications.

Clustering Dimensionality Reduction

Paper
Add Code

Randomized Algorithms for Data-Driven Stabilization of Stochastic Linear Systems

no code implementations • 16 May 2019 • Mohamad Kazem Shirani Faradonbeh, Ambuj Tewari, George Michailidis

We provide numerical analyses for the performance of two methods: stochastic feedback, and stochastic parameter.

Paper
Add Code

On Applications of Bootstrap in Continuous Space Reinforcement Learning

no code implementations • 14 Mar 2019 • Mohamad Kazem Shirani Faradonbeh, Ambuj Tewari, George Michailidis

In decision making problems for continuous state and action spaces, linear dynamical models are widely employed.

Decision Making reinforcement-learning +1

Paper
Add Code

Input Perturbations for Adaptive Control and Learning

no code implementations • 10 Nov 2018 • Mohamad Kazem Shirani Faradonbeh, Ambuj Tewari, George Michailidis

This paper studies adaptive algorithms for simultaneous regulation (i. e., control) and estimation (i. e., learning) of Multiple Input Multiple Output (MIMO) linear dynamical systems.

Paper
Add Code

Finite Time Adaptive Stabilization of LQ Systems

no code implementations • 22 Jul 2018 • Mohamad Kazem Shirani Faradonbeh, Ambuj Tewari, George Michailidis

There are only a few existing non-asymptotic results and a full treatment of the problem is not currently available.

Paper
Add Code

Optimism-Based Adaptive Regulation of Linear-Quadratic Systems

no code implementations • 20 Nov 2017 • Mohamad Kazem Shirani Faradonbeh, Ambuj Tewari, George Michailidis

The main challenge for adaptive regulation of linear-quadratic systems is the trade-off between identification and control.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.