no code implementations • 4 Mar 2023 • Amarildo Likmeta, Matteo Sacco, Alberto Maria Metelli, Marcello Restelli
Uncertainty quantification has been extensively used as a means to achieve efficient directed exploration in Reinforcement Learning (RL).
no code implementations • ICLR 2022 • Lorenzo Moro, Amarildo Likmeta, Enrico Prati, Marcello Restelli
It has been extended from complex continuous domains through function approximators to bias the search of the planning tree in AlphaZero.
1 code implementation • NeurIPS 2019 • Alberto Maria Metelli, Amarildo Likmeta, Marcello Restelli
How does the uncertainty of the value function propagate when performing temporal difference learning?