no code implementations • 20 Mar 2018 • Ethan Knight, Osher Lerner
We present a novel algorithm to train a deep Q-learning agent using natural-gradient techniques.
Hyperparameter Optimization Q-Learning +2