no code implementations • NeurIPS 2021 • Luong Ha, Nguyen, James-A. Goulet
In this paper, we present how we can adapt the temporal difference Q-learning framework to make it compatible with the tractable approximate Gaussian inference (TAGI), which allows learning the parameters of a neural network using a closed-form analytical method.