Search Results for author: Ioannis Anagnostides

Found 8 papers, 0 papers with code

Optimistic Policy Gradient in Multi-Player Markov Games with a Single Controller: Convergence Beyond the Minty Property

no code implementations • 19 Dec 2023 • Ioannis Anagnostides, Ioannis Panageas, Gabriele Farina, Tuomas Sandholm

Policy gradient methods enjoy strong practical performance in numerous tasks in reinforcement learning.

Paper
Add Code

Near-Optimal $Φ$-Regret Learning in Extensive-Form Games

no code implementations • 20 Aug 2022 • Ioannis Anagnostides, Gabriele Farina, Tuomas Sandholm

In this paper, we establish efficient and uncoupled learning dynamics so that, when employed by all players in multiplayer perfect-recall imperfect-information extensive-form games, the trigger regret of each player grows as $O(\log T)$ after $T$ repetitions of play.

Open-Ended Question Answering

Paper
Add Code

Efficiently Computing Nash Equilibria in Adversarial Team Markov Games

no code implementations • 3 Aug 2022 • Fivos Kalogiannis, Ioannis Anagnostides, Ioannis Panageas, Emmanouil-Vasileios Vlatakis-Gkaragkounis, Vaggos Chatziafratis, Stelios Stavroulakis

In this work, we depart from those prior results by investigating infinite-horizon \emph{adversarial team Markov games}, a natural and well-motivated class of games in which a team of identically-interested players -- in the absence of any explicit coordination or communication -- is competing against an adversarial player.

Multi-agent Reinforcement Learning

Paper
Add Code

Near-Optimal No-Regret Learning Dynamics for General Convex Games

no code implementations • 17 Jun 2022 • Gabriele Farina, Ioannis Anagnostides, Haipeng Luo, Chung-Wei Lee, Christian Kroer, Tuomas Sandholm

In this paper, we answer this in the positive by establishing the first uncoupled learning algorithm with $O(\log T)$ per-player regret in general \emph{convex games}, that is, games with concave utility functions supported on arbitrary convex and compact strategy sets.

Paper
Add Code

Uncoupled Learning Dynamics with $O(\log T)$ Swap Regret in Multiplayer Games

no code implementations • 25 Apr 2022 • Ioannis Anagnostides, Gabriele Farina, Christian Kroer, Chung-Wei Lee, Haipeng Luo, Tuomas Sandholm

In this paper we establish efficient and \emph{uncoupled} learning dynamics so that, when employed by all players in a general-sum multiplayer game, the \emph{swap regret} of each player after $T$ repetitions of the game is bounded by $O(\log T)$, improving over the prior best bounds of $O(\log^4 (T))$.

Paper
Add Code

Near-Optimal No-Regret Learning for Correlated Equilibria in Multi-Player General-Sum Games

no code implementations • 11 Nov 2021 • Ioannis Anagnostides, Constantinos Daskalakis, Gabriele Farina, Maxwell Fishelson, Noah Golowich, Tuomas Sandholm

Recently, Daskalakis, Fishelson, and Golowich (DFG) (NeurIPS`21) showed that if all agents in a multi-player general-sum normal-form game employ Optimistic Multiplicative Weights Update (OMWU), the external regret of every player is $O(\textrm{polylog}(T))$ after $T$ repetitions of the game.

Paper
Add Code

Faster No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium

no code implementations • 29 Sep 2021 • Ioannis Anagnostides, Gabriele Farina, Christian Kroer, Tuomas Sandholm

A recent emerging trend in the literature on learning in games has been concerned with providing accelerated learning dynamics for correlated and coarse correlated equilibria in normal-form games.

Paper
Add Code

Robust Learning under Strong Noise via SQs

no code implementations • 18 Oct 2020 • Ioannis Anagnostides, Themis Gouleakis, Ali Marashian

This work provides several new insights on the robustness of Kearns' statistical query framework against challenging label-noise models.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.