Search Results for author: Amarildo Likmeta

Wasserstein Actor-Critic: Directed Exploration via Optimism for Continuous-Actions Control

Uncertainty quantification has been extensively used as a means to achieve efficient directed exploration in Reinforcement Learning (RL).

Paper
Add Code

It has been extended from complex continuous domains through function approximators to bias the search of the planning tree in AlphaZero.

Paper
Add Code

How does the uncertainty of the value function propagate when performing temporal difference learning?

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.