no code implementations • 11 Jun 2021 • Sanath Kumar Krishnamurthy, Adrienne Margaret Propp, Susan Athey
Our algorithm is based on a novel misspecification test, and our analysis demonstrates the benefits of using model selection for reward estimation.