Search Results for author: Ahmadreza Moradipari

Found 10 papers, 1 papers with code

Convex Methods for Constrained Linear Bandits

no code implementations • 7 Nov 2023 • Amirhossein Afsharrad, Ahmadreza Moradipari, Sanjay Lall

Recently, bandit optimization has received significant attention in real-world safety-critical systems that involve repeated interactions with humans.

Paper
Add Code

Improved Bayesian Regret Bounds for Thompson Sampling in Reinforcement Learning

no code implementations • NeurIPS 2023 • Ahmadreza Moradipari, Mohammad Pedramfar, Modjtaba Shokrian Zini, Vaneet Aggarwal

In this paper, we prove the first Bayesian regret bounds for Thompson Sampling in reinforcement learning in a multitude of settings.

reinforcement-learning Thompson Sampling

Paper
Add Code

Controlling the Latent Space of GANs through Reinforcement Learning: A Case Study on Task-based Image-to-Image Translation

no code implementations • 26 Jul 2023 • Mahyar Abbasian, Taha Rajabzadeh, Ahmadreza Moradipari, Seyed Amir Hossein Aqajari, HongSheng Lu, Amir Rahmani

Generative Adversarial Networks (GAN) have emerged as a formidable AI tool to generate realistic outputs based on training datasets.

Image-to-Image Translation Reinforcement Learning (RL)

Paper
Add Code

Collaborative Multi-agent Stochastic Linear Bandits

no code implementations • 12 May 2022 • Ahmadreza Moradipari, Mohammad Ghavamzadeh, Mahnoosh Alizadeh

We propose a distributed upper confidence bound (UCB) algorithm and prove a high probability bound on its $T$-round regret in which we include a linear growth of regret associated with each communication round.

Paper
Add Code

Multi-Environment Meta-Learning in Stochastic Linear Bandits

no code implementations • 12 May 2022 • Ahmadreza Moradipari, Mohammad Ghavamzadeh, Taha Rajabzadeh, Christos Thrampoulidis, Mahnoosh Alizadeh

In this work we investigate meta-learning (or learning-to-learn) approaches in multi-task linear stochastic bandit problems that can originate from multiple environments.

Meta-Learning

Paper
Add Code

Feature and Parameter Selection in Stochastic Linear Bandits

no code implementations • 9 Jun 2021 • Ahmadreza Moradipari, Berkay Turan, Yasin Abbasi-Yadkori, Mahnoosh Alizadeh, Mohammad Ghavamzadeh

In the second setting, the reward parameter of the LB problem is arbitrarily selected from $M$ models represented as (possibly) overlapping balls in $\mathbb R^d$.

feature selection Model Selection

Paper
Add Code

Stage-wise Conservative Linear Bandits

no code implementations • NeurIPS 2020 • Ahmadreza Moradipari, Christos Thrampoulidis, Mahnoosh Alizadeh

For this problem, we present two novel algorithms, stage-wise conservative linear Thompson Sampling (SCLTS) and stage-wise conservative linear UCB (SCLUCB), that respect the baseline constraints and enjoy probabilistic regret bounds of order O(\sqrt{T} \log^{3/2}T) and O(\sqrt{T} \log T), respectively.

Thompson Sampling

Paper
Add Code

Coagent Networks Revisited

1 code implementation • 28 Jan 2020 • Modjtaba Shokrian Zini, Mohammad Pedramfar, Matthew Riemer, Ahmadreza Moradipari, Miao Liu

Coagent networks formalize the concept of arbitrary networks of stochastic agents that collaborate to take actions in a reinforcement learning environment.

Hierarchical Reinforcement Learning reinforcement-learning

Paper
Code

Safe Linear Thompson Sampling with Side Information

no code implementations • 6 Nov 2019 • Ahmadreza Moradipari, Sanae Amani, Mahnoosh Alizadeh, Christos Thrampoulidis

We compare the performance of our algorithm with UCB-based safe algorithms and highlight how the inherently randomized nature of TS leads to a superior performance in expanding the set of safe actions the algorithm has access to at each round.

Thompson Sampling

Paper
Add Code

Using Empirical Covariance Matrix in Enhancing Prediction Accuracy of Linear Models with Missing Information

no code implementations • 21 Nov 2016 • Ahmadreza Moradipari, Sina Shahsavari, Ashkan Esmaeili, Farokh Marvasti

When sparse models are also suffering from MI, the sparse recovery and inference of the missing models are taken into account simultaneously.

feature selection Learning Theory +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.