no code implementations • 29 Sep 2021 • Alberto Maria Metelli, Samuele Meta, Marcello Restelli
In this setting, Importance Sampling (IS) is typically employed as a what-if analysis tool, with the goal of estimating the performance of a target policy, given samples collected with a different behavioral policy.