Search Results for author: Diogo Carvalho

Found 1 papers, 0 papers with code

A new convergent variant of Q-learning with linear function approximation

no code implementations NeurIPS 2020 Diogo Carvalho, Francisco S. Melo, Pedro Santos

In this work, we identify a novel set of conditions that ensure convergence with probability 1 of Q-learning with linear function approximation, by proposing a two time-scale variation thereof.

Q-Learning reinforcement Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.