Search Results for author: Joan Bas-Serrano

Found 2 papers, 0 papers with code

Logistic Q-Learning

no code implementations21 Oct 2020 Joan Bas-Serrano, Sebastian Curi, Andreas Krause, Gergely Neu

We propose a new reinforcement learning algorithm derived from a regularized linear-programming formulation of optimal control in MDPs.

Q-Learning Reinforcement Learning (RL)

Faster saddle-point optimization for solving large-scale Markov decision processes

no code implementations L4DC 2020 Joan Bas-Serrano, Gergely Neu

We consider the problem of computing optimal policies in average-reward Markov decision processes.

Cannot find the paper you are looking for? You can Submit a new open access paper.