no code implementations • 4 Dec 2022 • Yusuke Narita, Kyohei Okumura, Akihiro Shimizu, Kohei Yata
Off-policy evaluation (OPE) attempts to predict the performance of counterfactual policies using log data from a different policy.
counterfactual Decision Making +1