no code implementations
21 Mar 2019
Deep Q-Learning (DQL), a family of temporal difference algorithms for control, employs three techniques collectively known as the `deadly triad' in reinforcement learning: bootstrapping, off-policy learning, and function approximation.
1 code implementation
20 Mar 2018
We present a novel algorithm to train a deep Q-learning agent using natural-gradient techniques.