Search Results for author: Michael Kaisers

Found 7 papers, 1 papers with code

BRExIt: On Opponent Modelling in Expert Iteration

no code implementations31 May 2022 Daniel Hernandez, Hendrik Baier, Michael Kaisers

Finding a best response policy is a central objective in game theory and multi-agent learning, with modern population-based training approaches employing reinforcement learning algorithms as best-response oracles to improve play against candidate opponents (typically previously learnt policies).

Online Planning in POMDPs with Self-Improving Simulators

1 code implementation27 Jan 2022 Jinke He, Miguel Suau, Hendrik Baier, Michael Kaisers, Frans A. Oliehoek

To plan reliably and efficiently while the approximate simulator is learning, we develop a method that adaptively decides which simulator to use for every simulation, based on a statistic that measures the accuracy of the approximate simulator.

Automated Peer-to-peer Negotiation for Energy Contract Settlements in Residential Cooperatives

no code implementations26 Nov 2019 Shantanu Chakraborty, Tim Baarslag, Michael Kaisers

This paper presents an automated peer-to-peer negotiation strategy for settling energy contracts among prosumers in a Residential Energy Cooperative considering heterogeneity prosumer preferences.

Fairness Navigate

Robust Temporal Difference Learning for Critical Domains

no code implementations23 Jan 2019 Richard Klima, Daan Bloembergen, Michael Kaisers, Karl Tuyls

We prove convergence of the operator to the optimal robust Q-function with respect to the model using the theory of Generalized Markov Decision Processes.

Energy Contract Settlements through Automated Negotiation in Residential Cooperatives

no code implementations28 Jul 2018 Shantanu Chakraborty, Tim Baarslag, Michael Kaisers

This paper presents an automated peer-to-peer (P2P) negotiation strategy for settling energy contracts among prosumers in a Residential Energy Cooperative (REC) considering heterogeneous prosumer preferences.

Multiagent Systems

A Survey of Learning in Multiagent Environments: Dealing with Non-Stationarity

no code implementations28 Jul 2017 Pablo Hernandez-Leal, Michael Kaisers, Tim Baarslag, Enrique Munoz de Cote

The key challenge in multiagent learning is learning a best response to the behaviour of other agents, which may be non-stationary: if the other agents adapt their strategy as well, the learning target moves.

Multi-Armed Bandits

Cannot find the paper you are looking for? You can Submit a new open access paper.