Search Results for author: Eric Xia

Found 4 papers, 0 papers with code

Krylov-Bellman boosting: Super-linear policy evaluation in general state spaces

no code implementations • 20 Oct 2022 • Eric Xia, Martin J. Wainwright

Second, by combining this meta-result with sample-size dependent guarantees for residual fitting and LSTD computation, we obtain concrete statistical guarantees that depend on the sample size along with the complexity of the function class used to fit the residuals.

Paper
Add Code

Instance-Dependent Confidence and Early Stopping for Reinforcement Learning

no code implementations • 21 Jan 2022 • Koulik Khamaru, Eric Xia, Martin J. Wainwright, Michael I. Jordan

As a consequence, we propose a data-dependent stopping rule for instance-optimal algorithms.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Instance-optimality in optimal value estimation: Adaptivity via variance-reduced Q-learning

no code implementations • 28 Jun 2021 • Koulik Khamaru, Eric Xia, Martin J. Wainwright, Michael I. Jordan

Various algorithms in reinforcement learning exhibit dramatic variability in their convergence rates and ultimate accuracy as a function of the problem structure.

Q-Learning

Paper
Add Code

Posterior Distribution for the Number of Clusters in Dirichlet Process Mixture Models

no code implementations • 23 May 2019 • Chiao-Yu Yang, Eric Xia, Nhat Ho, Michael I. Jordan

In this work, we provide a rigorous study for the posterior distribution of the number of clusters in DPMM under different prior distributions on the parameters and constraints on the distributions of the data.

Clustering

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.