no code implementations • 28 Mar 2024 • Johannes Müller, Semih Çaycı, Guido Montúfar
Kakade's natural policy gradient method has been studied extensively in the last years showing linear convergence with and without regularization.