1 code implementation • 4 Nov 2020 • Anand Kamat, Doina Precup
We show empirically that our proposed method is capable of learning options end-to-end on several discrete and continuous control tasks, outperforms option-critic by a wide margin.
Continuous Control