1 code implementation • NeurIPS 2023 • Daiki E. Matsunaga, Jongmin Lee, Jaeseok Yoon, Stefanos Leonardos, Pieter Abbeel, Kee-Eung Kim
To this end, we introduce AlberDICE, an offline MARL algorithm that alternatively performs centralized training of individual agents based on stationary distribution optimization.
no code implementations • NeurIPS 2021 • Stefanos Leonardos, Georgios Piliouras, Kelly Spendlove
The interplay between exploration and exploitation in competitive multi-agent learning is still far from being well understood.
1 code implementation • NeurIPS 2021 • Stefanos Leonardos, Will Overman, Ioannis Panageas, Georgios Piliouras
Counter-intuitively, insights from normal-form potential games do not carry over as MPGs can consist of settings where state-games can be zero-sum games.
no code implementations • 5 Dec 2020 • Stefanos Leonardos, Georgios Piliouras
Exploration-exploitation is a powerful and practical tool in multi-agent learning (MAL), however, its effects are far from understood.
Q-Learning Computer Science and Game Theory Multiagent Systems Dynamical Systems 93A16, 91A26, 91A68, 58K35 G.3; J.4; F.2.2
no code implementations • 24 Sep 2019 • Constandina Koki, Stefanos Leonardos, Georgios Piliouras
Conventional financial models fail to explain the economic and monetary properties of cryptocurrencies due to the latter's dual nature: their usage as financial assets on the one side and their tight connection to the underlying blockchain structure on the other.
no code implementations • 9 Jun 2019 • Stefanos Leonardos, Costis Melolidakis
We revisit the linear Cournot model with uncertain demand that is studied in Lagerl\"of (2006)* and provide sufficient conditions for equilibrium uniqueness that complement the existing results.