Search Results for author: Francesco Vidaich

Found 1 papers, 0 papers with code

Lifelong Hyper-Policy Optimization with Multiple Importance Sampling Regularization

no code implementations13 Dec 2021 Pierre Liotet, Francesco Vidaich, Alberto Maria Metelli, Marcello Restelli

This hyper-policy is trained to maximize the estimated future performance, efficiently reusing past data by means of importance sampling, at the cost of introducing a controlled bias.

Management

Cannot find the paper you are looking for? You can Submit a new open access paper.