Search Results for author: Diogo Carvalho

Found 1 papers, 0 papers with code

A new convergent variant of Q-learning with linear function approximation

no code implementations • NeurIPS 2020 • Diogo Carvalho, Francisco S. Melo, Pedro Santos

In this work, we identify a novel set of conditions that ensure convergence with probability 1 of Q-learning with linear function approximation, by proposing a two time-scale variation thereof.

Q-Learning Reinforcement Learning (RL)

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.