Search Results for author: Eric Xia

Found 4 papers, 0 papers with code

Krylov-Bellman boosting: Super-linear policy evaluation in general state spaces

no code implementations20 Oct 2022 Eric Xia, Martin J. Wainwright

Second, by combining this meta-result with sample-size dependent guarantees for residual fitting and LSTD computation, we obtain concrete statistical guarantees that depend on the sample size along with the complexity of the function class used to fit the residuals.

Instance-optimality in optimal value estimation: Adaptivity via variance-reduced Q-learning

no code implementations28 Jun 2021 Koulik Khamaru, Eric Xia, Martin J. Wainwright, Michael I. Jordan

Various algorithms in reinforcement learning exhibit dramatic variability in their convergence rates and ultimate accuracy as a function of the problem structure.

Q-Learning

Posterior Distribution for the Number of Clusters in Dirichlet Process Mixture Models

no code implementations23 May 2019 Chiao-Yu Yang, Eric Xia, Nhat Ho, Michael I. Jordan

In this work, we provide a rigorous study for the posterior distribution of the number of clusters in DPMM under different prior distributions on the parameters and constraints on the distributions of the data.

Clustering

Cannot find the paper you are looking for? You can Submit a new open access paper.