1 code implementation • 7 Oct 2019 • Ian A. Kash, Michael Sullins, Katja Hofmann
Counterfactual Regret Minimization (CFR) has found success in settings like poker which have both terminal states and perfect recall.
counterfactual Q-Learning