Search Results for author: Stavros Gerakaris

Found 1 papers, 0 papers with code

Learning Best Response Strategies for Agents in Ad Exchanges

no code implementations10 Feb 2019 Stavros Gerakaris, Subramanian Ramamoorthy

We address this problem using the Harsanyi-Bellman Ad Hoc Coordination (HBA) algorithm, which conceptualises this interaction in terms of a Stochastic Bayesian Game and arrives at optimal actions by best responding with respect to probabilistic beliefs maintained over a candidate set of opponent behaviour profiles.

Q-Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.