Search Results for author: Mustafa Mert Çelikok

Found 7 papers, 3 papers with code

Towards a Unifying Model of Rationality in Multiagent Systems

no code implementations29 May 2023 Robert Loftin, Mustafa Mert Çelikok, Frans A. Oliehoek

Multiagent systems deployed in the real world need to cooperate with other agents (including humans) nearly as effectively as these agents cooperate with one another.

Uncoupled Learning of Differential Stackelberg Equilibria with Commitments

no code implementations7 Feb 2023 Robert Loftin, Mustafa Mert Çelikok, Herke van Hoof, Samuel Kaski, Frans A. Oliehoek

A natural solution concept for many multiagent settings is the Stackelberg equilibrium, under which a ``leader'' agent selects a strategy that maximizes its own payoff assuming the ``follower'' chooses their best response to this strategy.

Multi-agent Reinforcement Learning

Differentiable User Models

1 code implementation29 Nov 2022 Alex Hämäläinen, Mustafa Mert Çelikok, Samuel Kaski

Probabilistic user modeling is essential for building machine learning systems in the ubiquitous cases with humans in the loop.

Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems

1 code implementation1 Jul 2022 Miguel Suau, Jinke He, Mustafa Mert Çelikok, Matthijs T. J. Spaan, Frans A. Oliehoek

Due to its high sample complexity, simulation is, as of today, critical for the successful application of reinforcement learning.

Interactive AI with a Theory of Mind

no code implementations1 Dec 2019 Mustafa Mert Çelikok, Tomi Peltola, Pedram Daee, Samuel Kaski

Understanding each other is the key to success in collaboration.

Machine Teaching of Active Sequential Learners

1 code implementation NeurIPS 2019 Tomi Peltola, Mustafa Mert Çelikok, Pedram Daee, Samuel Kaski

We formulate this sequential teaching problem, which current techniques in machine teaching do not address, as a Markov decision process, with the dynamics nesting a model of the learner and the actions being the teacher's responses.

Multi-Armed Bandits Probabilistic Programming

Cannot find the paper you are looking for? You can Submit a new open access paper.