Search Results for author: Simon Weissmann

Found 2 papers, 0 papers with code

Beyond Stationarity: Convergence Analysis of Stochastic Softmax Policy Gradient Methods

no code implementations4 Oct 2023 Sara Klein, Simon Weissmann, Leif Döring

This paper introduces a combination of dynamic programming and policy gradient called dynamic policy gradient, where the parameters are trained backwards in time.

Decision Making Policy Gradient Methods

Cannot find the paper you are looking for? You can Submit a new open access paper.