Search Results for author: Batuhan Yardim

Found 3 papers, 0 papers with code

On the Statistical Efficiency of Mean Field Reinforcement Learning with General Function Approximation

no code implementations18 May 2023 Jiawei Huang, Batuhan Yardim, Niao He

In this paper, we study the fundamental statistical efficiency of Reinforcement Learning in Mean-Field Control (MFC) and Mean-Field Game (MFG) with general model-based function approximation.

Policy Mirror Ascent for Efficient and Independent Learning in Mean Field Games

no code implementations29 Dec 2022 Batuhan Yardim, Semih Cayci, Matthieu Geist, Niao He

Instead, we show that $N$ agents running policy mirror ascent converge to the Nash equilibrium of the regularized game within $\widetilde{\mathcal{O}}(\varepsilon^{-2})$ samples from a single sample trajectory without a population generative model, up to a standard $\mathcal{O}(\frac{1}{\sqrt{N}})$ error due to the mean field.

Trust Region Policy Optimization with Optimal Transport Discrepancies: Duality and Algorithm for Continuous Actions

no code implementations20 Oct 2022 Antonio Terpin, Nicolas Lanzetti, Batuhan Yardim, Florian Dörfler, Giorgia Ramponi

In this paper, we explore optimal transport discrepancies (which include the Wasserstein distance) to define trust regions, and we propose a novel algorithm - Optimal Transport Trust Region Policy Optimization (OT-TRPO) - for continuous state-action spaces.

Continuous Control

Cannot find the paper you are looking for? You can Submit a new open access paper.