Search Results for author: Michael Sullins

Found 1 papers, 1 papers with code

Combining No-regret and Q-learning

1 code implementation7 Oct 2019 Ian A. Kash, Michael Sullins, Katja Hofmann

Counterfactual Regret Minimization (CFR) has found success in settings like poker which have both terminal states and perfect recall.

counterfactual Q-Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.