Search Results for author: Mohamad Kazem Shirani Faradonbeh

Found 16 papers, 0 papers with code

Thompson Sampling in Partially Observable Contextual Bandits

no code implementations15 Feb 2024 Hongju Park, Mohamad Kazem Shirani Faradonbeh

Accordingly, a fundamental problem is that of balancing exploration (i. e., pulling different arms to learn their parameters), versus exploitation (i. e., pulling the best arms to gain reward).

Decision Making Decision Making Under Uncertainty +2

Thompson Sampling Efficiently Learns to Control Diffusion Processes

no code implementations20 Jun 2022 Mohamad Kazem Shirani Faradonbeh, Mohamad Sadegh Shirani Faradonbeh, Mohsen Bayati

To the best of our knowledge, this is the first such result for Thompson sampling in a diffusion process control problem.

Decision Making Thompson Sampling

Regret Analysis of Certainty Equivalence Policies in Continuous-Time Linear-Quadratic Systems

no code implementations9 Jun 2022 Mohamad Kazem Shirani Faradonbeh

This work theoretically studies a ubiquitous reinforcement learning policy for controlling the canonical model of continuous-time stochastic linear-quadratic systems.

reinforcement-learning Reinforcement Learning (RL)

Worst-case Performance of Greedy Policies in Bandits with Imperfect Context Observations

no code implementations10 Apr 2022 Hongju Park, Mohamad Kazem Shirani Faradonbeh

Contextual bandits are canonical models for sequential decision-making under uncertainty in environments with time-varying components.

Decision Making Decision Making Under Uncertainty +1

Efficient Algorithms for Learning to Control Bandits with Unobserved Contexts

no code implementations2 Feb 2022 Hongju Park, Mohamad Kazem Shirani Faradonbeh

Contextual bandits are widely-used in the study of learning-based control policies for finite action spaces.

Multi-Armed Bandits

Bayesian Algorithms Learn to Stabilize Unknown Continuous-Time Systems

no code implementations30 Dec 2021 Mohamad Kazem Shirani Faradonbeh, Mohamad Sadegh Shirani Faradonbeh

Linear dynamical systems are canonical models for learning-based control of plants with uncertain dynamics.

Joint Learning of Linear Time-Invariant Dynamical Systems

no code implementations21 Dec 2021 Aditya Modi, Mohamad Kazem Shirani Faradonbeh, Ambuj Tewari, George Michailidis

Linear time-invariant systems are very popular models in system theory and applications.

Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits

no code implementations23 Oct 2021 Hongju Park, Mohamad Kazem Shirani Faradonbeh

We propose a Thompson Sampling algorithm for partially observable contextual multi-armed bandits, and establish theoretical performance guarantees.

Decision Making Multi-Armed Bandits +3

Online Distributed Estimation of Principal Eigenspaces

no code implementations17 May 2019 Davoud Ataee Tarzanagh, Mohamad Kazem Shirani Faradonbeh, George Michailidis

Principal components analysis (PCA) is a widely used dimension reduction technique with an extensive range of applications.

Clustering Dimensionality Reduction

Randomized Algorithms for Data-Driven Stabilization of Stochastic Linear Systems

no code implementations16 May 2019 Mohamad Kazem Shirani Faradonbeh, Ambuj Tewari, George Michailidis

We provide numerical analyses for the performance of two methods: stochastic feedback, and stochastic parameter.

Input Perturbations for Adaptive Control and Learning

no code implementations10 Nov 2018 Mohamad Kazem Shirani Faradonbeh, Ambuj Tewari, George Michailidis

This paper studies adaptive algorithms for simultaneous regulation (i. e., control) and estimation (i. e., learning) of Multiple Input Multiple Output (MIMO) linear dynamical systems.

Finite Time Adaptive Stabilization of LQ Systems

no code implementations22 Jul 2018 Mohamad Kazem Shirani Faradonbeh, Ambuj Tewari, George Michailidis

There are only a few existing non-asymptotic results and a full treatment of the problem is not currently available.

Optimism-Based Adaptive Regulation of Linear-Quadratic Systems

no code implementations20 Nov 2017 Mohamad Kazem Shirani Faradonbeh, Ambuj Tewari, George Michailidis

The main challenge for adaptive regulation of linear-quadratic systems is the trade-off between identification and control.

Cannot find the paper you are looking for? You can Submit a new open access paper.