2 code implementations • 13 Feb 2025 • Max Rudolph, Nathan Lichtle, Sobhan Mohammadpour, Alexandre Bayen, J. Zico Kolter, Amy Zhang, Gabriele Farina, Eugene Vinitsky, Samuel Sokota
In the past decade, motivated by the putative failure of naive self-play deep reinforcement learning (DRL) in adversarial imperfect-information games, researchers have developed numerous DRL algorithms based on fictitious play (FP), double oracle (DO), and counterfactual regret minimization (CFR).
1 code implementation • 10 Jun 2024 • Sobhan Mohammadpour, Emma Frejinger, Pierre-Luc Bacon
While standard unregularized RL methods remain unaffected by changes in the number of actions, we show that it can severely impact their regularized counterparts.
no code implementations • 21 Dec 2023 • Sobhan Mohammadpour, Emmanuel Bengio, Emma Frejinger, Pierre-Luc Bacon
Generative Flow Networks (GFNs) have emerged as a powerful tool for sampling discrete objects from unnormalized distributions, offering a scalable alternative to Markov Chain Monte Carlo (MCMC) methods.
no code implementations • 25 Oct 2022 • Sobhan Mohammadpour, Emma Frejinger
We propose a method for maximum likelihood estimation of path choice model parameters and arc travel time using data of different levels of granularity.