Search Results for author: Modjtaba Shokrian Zini

Found 2 papers, 1 papers with code

Improved Bayesian Regret Bounds for Thompson Sampling in Reinforcement Learning

no code implementations • NeurIPS 2023 • Ahmadreza Moradipari, Mohammad Pedramfar, Modjtaba Shokrian Zini, Vaneet Aggarwal

In this paper, we prove the first Bayesian regret bounds for Thompson Sampling in reinforcement learning in a multitude of settings.

reinforcement-learning Thompson Sampling

Paper
Add Code

Coagent Networks Revisited

1 code implementation • 28 Jan 2020 • Modjtaba Shokrian Zini, Mohammad Pedramfar, Matthew Riemer, Ahmadreza Moradipari, Miao Liu

Coagent networks formalize the concept of arbitrary networks of stochastic agents that collaborate to take actions in a reinforcement learning environment.

Hierarchical Reinforcement Learning reinforcement-learning

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.