1 code implementation • 9 Nov 2020 • Kaleb Ben Naveed, Zhiqian Qiao, John M. Dolan
The problem of incomplete observations is handled by using a Long-Short-Term-Memory (LSTM) layer in the network.
Autonomous Driving Hierarchical Reinforcement Learning +5