Search Results for author: Sobhan Mohammadpour

Found 4 papers, 2 papers with code

Reevaluating Policy Gradient Methods for Imperfect-Information Games

2 code implementations13 Feb 2025 Max Rudolph, Nathan Lichtle, Sobhan Mohammadpour, Alexandre Bayen, J. Zico Kolter, Amy Zhang, Gabriele Farina, Eugene Vinitsky, Samuel Sokota

In the past decade, motivated by the putative failure of naive self-play deep reinforcement learning (DRL) in adversarial imperfect-information games, researchers have developed numerous DRL algorithms based on fictitious play (FP), double oracle (DO), and counterfactual regret minimization (CFR).

counterfactual Deep Reinforcement Learning +1

Decoupling regularization from the action space

1 code implementation10 Jun 2024 Sobhan Mohammadpour, Emma Frejinger, Pierre-Luc Bacon

While standard unregularized RL methods remain unaffected by changes in the number of actions, we show that it can severely impact their regularized counterparts.

Reinforcement Learning (RL)

Maximum entropy GFlowNets with soft Q-learning

no code implementations21 Dec 2023 Sobhan Mohammadpour, Emmanuel Bengio, Emma Frejinger, Pierre-Luc Bacon

Generative Flow Networks (GFNs) have emerged as a powerful tool for sampling discrete objects from unnormalized distributions, offering a scalable alternative to Markov Chain Monte Carlo (MCMC) methods.

Q-Learning Reinforcement Learning (RL)

Arc travel time and path choice model estimation subsumed

no code implementations25 Oct 2022 Sobhan Mohammadpour, Emma Frejinger

We propose a method for maximum likelihood estimation of path choice model parameters and arc travel time using data of different levels of granularity.

ARC

Cannot find the paper you are looking for? You can Submit a new open access paper.