Search Results for author: Michael Sullins

Combining No-regret and Q-learning

Counterfactual Regret Minimization (CFR) has found success in settings like poker which have both terminal states and perfect recall.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.