Search Results for author: Alexandre Marthe

Found 1 papers, 0 papers with code

Beyond Average Return in Markov Decision Processes

no code implementations NeurIPS 2023 Alexandre Marthe, Aurélien Garivier, Claire Vernade

What are the functionals of the reward that can be computed and optimized exactly in Markov Decision Processes? In the finite-horizon, undiscounted setting, Dynamic Programming (DP) can only handle these operations efficiently for certain classes of statistics.

Distributional Reinforcement Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.