Search Results for author: Koulik Khamaru

Found 16 papers, 2 papers with code

Stochastic Optimization with Constraints: A Non-asymptotic Instance-Dependent Analysis

no code implementations • 24 Mar 2024 • Koulik Khamaru

Our main result is a non-asymptotic guarantee for VRPG algorithm.

Paper
Add Code

Statistical Limits of Adaptive Linear Models: Low-Dimensional Estimation and Inference

1 code implementation • NeurIPS 2023 • Licong Lin, Mufang Ying, Suvrojit Ghosh, Koulik Khamaru, Cun-Hui Zhang

Even in linear models, the Ordinary Least Squares (OLS) estimator may fail to exhibit asymptotic normality for single coordinate estimation and have inflated error.

Paper
Code

Adaptive Linear Estimating Equations

1 code implementation • NeurIPS 2023 • Mufang Ying, Koulik Khamaru, Cun-Hui Zhang

Sequential data collection has emerged as a widely adopted technique for enhancing the efficiency of data gathering processes.

Multi-Armed Bandits

Paper
Code

Semi-parametric inference based on adaptively collected data

no code implementations • 5 Mar 2023 • Licong Lin, Koulik Khamaru, Martin J. Wainwright

Many standard estimators, when applied to adaptively collected data, fail to be asymptotically normal, thereby complicating the construction of confidence intervals.

Paper
Add Code

Instance-Dependent Confidence and Early Stopping for Reinforcement Learning

no code implementations • 21 Jan 2022 • Koulik Khamaru, Eric Xia, Martin J. Wainwright, Michael I. Jordan

As a consequence, we propose a data-dependent stopping rule for instance-optimal algorithms.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Optimal variance-reduced stochastic approximation in Banach spaces

no code implementations • 21 Jan 2022 • Wenlong Mou, Koulik Khamaru, Martin J. Wainwright, Peter L. Bartlett, Michael I. Jordan

We study the problem of estimating the fixed point of a contractive operator defined on a separable Banach space.

Q-Learning

Paper
Add Code

Near-optimal inference in adaptive linear regression

no code implementations • 5 Jul 2021 • Koulik Khamaru, Yash Deshpande, Tor Lattimore, Lester Mackey, Martin J. Wainwright

We propose a family of online debiasing estimators to correct these distributional anomalies in least squares estimation.

Active Learning regression +2

Paper
Add Code

Instance-optimality in optimal value estimation: Adaptivity via variance-reduced Q-learning

no code implementations • 28 Jun 2021 • Koulik Khamaru, Eric Xia, Martin J. Wainwright, Michael I. Jordan

Various algorithms in reinforcement learning exhibit dramatic variability in their convergence rates and ultimate accuracy as a function of the problem structure.

Q-Learning

Paper
Add Code

Instability, Computational Efficiency and Statistical Accuracy

no code implementations • 22 May 2020 • Nhat Ho, Koulik Khamaru, Raaz Dwivedi, Martin J. Wainwright, Michael. I. Jordan, Bin Yu

Many statistical estimators are defined as the fixed point of a data-dependent operator, with estimators based on minimizing a cost function being an important special case.

Computational Efficiency

Paper
Add Code

Is Temporal Difference Learning Optimal? An Instance-Dependent Analysis

no code implementations • 16 Mar 2020 • Koulik Khamaru, Ashwin Pananjady, Feng Ruan, Martin J. Wainwright, Michael. I. Jordan

We address the problem of policy evaluation in discounted Markov decision processes, and provide instance-dependent guarantees on the $\ell_\infty$-error under a generative model.

Paper
Add Code

Sharp Analysis of Expectation-Maximization for Weakly Identifiable Models

no code implementations • 1 Feb 2019 • Raaz Dwivedi, Nhat Ho, Koulik Khamaru, Martin J. Wainwright, Michael. I. Jordan, Bin Yu

We study a class of weakly identifiable location-scale mixture models for which the maximum likelihood estimates based on $n$ i. i. d.

Paper
Add Code

Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems

no code implementations • 20 Dec 2018 • Dhruv Malik, Ashwin Pananjady, Kush Bhatia, Koulik Khamaru, Peter L. Bartlett, Martin J. Wainwright

We focus on characterizing the convergence rate of these methods when applied to linear-quadratic systems, and study various settings of driving noise and reward feedback.

Paper
Add Code

Theoretical guarantees for EM under misspecified Gaussian mixture models

no code implementations • NeurIPS 2018 • Raaz Dwivedi, Nhật Hồ, Koulik Khamaru, Martin J. Wainwright, Michael. I. Jordan

We provide two classes of theoretical guarantees: first, we characterize the bias introduced due to the misspecification; and second, we prove that population EM converges at a geometric rate to the model projection under a suitable initialization condition.

Paper
Add Code

Singularity, Misspecification, and the Convergence Rate of EM

no code implementations • 1 Oct 2018 • Raaz Dwivedi, Nhat Ho, Koulik Khamaru, Michael. I. Jordan, Martin J. Wainwright, Bin Yu

A line of recent work has analyzed the behavior of the Expectation-Maximization (EM) algorithm in the well-specified setting, in which the population likelihood is locally strongly concave around its maximizing argument.

Paper
Add Code

Convergence guarantees for a class of non-convex and non-smooth optimization problems

no code implementations • ICML 2018 • Koulik Khamaru, Martin J. Wainwright

We also show that our algorithms can escape strict saddle points for a class of non-smooth functions, thereby generalizing known results for smooth functions.

Density Estimation

Paper
Add Code

Computation of the Maximum Likelihood estimator in low-rank Factor Analysis

no code implementations • 18 Jan 2018 • Koulik Khamaru, Rahul Mazumder

Factor analysis, a classical multivariate statistical technique is popularly used as a fundamental tool for dimensionality reduction in statistics, econometrics and data science.

Dimensionality Reduction Econometrics

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.