1 code implementation • 13 Sep 2021 • Simon P. Shen, Yecheng Jason Ma, Omer Gottesman, Finale Doshi-Velez
Importance sampling-based estimators for off-policy evaluation (OPE) are valued for their simplicity, unbiasedness, and reliance on relatively few assumptions.