Search Results for author: Pedro Santos

Found 2 papers, 1 papers with code

A new convergent variant of Q-learning with linear function approximation

no code implementations NeurIPS 2020 Diogo Carvalho, Francisco S. Melo, Pedro Santos

In this work, we identify a novel set of conditions that ensure convergence with probability 1 of Q-learning with linear function approximation, by proposing a two time-scale variation thereof.

Q-Learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.