Search Results for author: Mirco Mutti

Found 3 papers, 1 papers with code

Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State Entropy Estimate

1 code implementation9 Jul 2020 Mirco Mutti, Lorenzo Pratissoli, Marcello Restelli

In a reward-free environment, what is a suitable intrinsic objective for an agent to pursue so that it can learn an optimal task-agnostic exploration policy?

Continuous Control

An Intrinsically-Motivated Approach for Learning Highly Exploring and Fast Mixing Policies

no code implementations10 Jul 2019 Mirco Mutti, Marcello Restelli

What is a good exploration strategy for an agent that interacts with an environment in the absence of external rewards?

Model-based Reinforcement Learning

Configurable Markov Decision Processes

no code implementations ICML 2018 Alberto Maria Metelli, Mirco Mutti, Marcello Restelli

After having introduced our approach and derived some theoretical results, we present the experimental evaluation in two explicative problems to show the benefits of the environment configurability on the performance of the learned policy.

Cannot find the paper you are looking for? You can Submit a new open access paper.