Search Results for author: Lili Wu

Found 4 papers, 2 papers with code

Anytime-valid off-policy inference for contextual bandits

1 code implementation19 Oct 2022 Ian Waudby-Smith, Lili Wu, Aaditya Ramdas, Nikos Karampatziakis, Paul Mineiro

Importantly, our methods can be employed while the original experiment is still running (that is, not necessarily post-hoc), when the logging policy may be itself changing (due to learning), and even if the context distributions are a highly dependent time-series (such as if they are drifting over time).

Multi-Armed Bandits Off-policy evaluation +1

Parameterized Exploration

no code implementations13 Jul 2019 Jesse Clifton, Lili Wu, Eric Laber

We introduce Parameterized Exploration (PE), a simple family of methods for model-based tuning of the exploration schedule in sequential decision problems.

Multi-Armed Bandits

Multivariate Density Estimation with Missing Data

1 code implementation14 Aug 2018 Zhen Li, Lili Wu, Weilian Zhou, Sujit Ghosh

Multivariate density estimation is a popular technique in statistics with wide applications including regression models allowing for heteroskedasticity in conditional variances.

Methodology

Cannot find the paper you are looking for? You can Submit a new open access paper.