Search Results for author: Joan Bas-Serrano

Logistic Q-Learning

We propose a new reinforcement learning algorithm derived from a regularized linear-programming formulation of optimal control in MDPs.

Paper
Add Code

We consider the problem of computing optimal policies in average-reward Markov decision processes.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.