1 code implementation • 4 Jul 2012 • Finnegan Southey, Michael P. Bowling, Bryce Larson, Carmelo Piccione, Neil Burch, Darse Billings, Chris Rayner
We demonstrate methods for playing effective responses to the opponent, based on the posterior.
no code implementations • 13 Jun 2012 • Richard S. Sutton, Csaba Szepesvari, Alborz Geramifard, Michael P. Bowling
Our main results are to prove that linear Dyna-style planning converges to a unique solution independent of the generating distribution, under natural conditions.