Search Results for author: Ilmun Kim

Found 11 papers, 5 papers with code

Semi-Supervised U-statistics

no code implementations • 29 Feb 2024 • Ilmun Kim, Larry Wasserman, Sivaraman Balakrishnan, Matey Neykov

Semi-supervised datasets are ubiquitous across diverse domains where obtaining fully labeled data is costly or time-consuming.

Paper
Add Code

Differentially Private Permutation Tests: Applications to Kernel Methods

2 code implementations • 29 Oct 2023 • Ilmun Kim, Antonin Schrab

The proposed framework extends classical non-private permutation tests to private settings, maintaining both finite-sample validity and differential privacy in a rigorous manner.

Paper
Code

A Permutation-Free Kernel Independence Test

no code implementations • 18 Dec 2022 • Shubhanshu Shekhar, Ilmun Kim, Aaditya Ramdas

In nonparametric independence testing, we observe i. i. d.\ data $\{(X_i, Y_i)\}_{i=1}^n$, where $X \in \mathcal{X}, Y \in \mathcal{Y}$ lie in any general spaces, and we wish to test the null that $X$ is independent of $Y$.

Paper
Add Code

A Permutation-free Kernel Two-Sample Test

no code implementations • 27 Nov 2022 • Shubhanshu Shekhar, Ilmun Kim, Aaditya Ramdas

The usual kernel-MMD test statistic is a degenerate U-statistic under the null, and thus it has an intractable limiting distribution.

Two-sample testing Vocal Bursts Valence Prediction

Paper
Add Code

The Projected Covariance Measure for assumption-lean variable significance testing

1 code implementation • 3 Nov 2022 • Anton Rask Lundborg, Ilmun Kim, Rajen D. Shah, Richard J. Samworth

In this work we study the problem of testing the model-free null of conditional mean independence, i. e. that the conditional mean of $Y$ given $X$ and $Z$ does not depend on $X$.

Additive models regression

Paper
Code

Efficient Aggregated Kernel Tests using Incomplete $U$-statistics

4 code implementations • 18 Jun 2022 • Antonin Schrab, Ilmun Kim, Benjamin Guedj, Arthur Gretton

We derive non-asymptotic uniform separation rates for MMDAggInc and HSICAggInc, and quantify exactly the trade-off between computational efficiency and the attainable rates: this result is novel for tests based on incomplete $U$-statistics, to our knowledge.

Computational Efficiency

Paper
Code

MMD Aggregated Two-Sample Test

3 code implementations • NeurIPS 2023 • Antonin Schrab, Ilmun Kim, Mélisande Albert, Béatrice Laurent, Benjamin Guedj, Arthur Gretton

In practice, this parameter is unknown and, hence, the optimal MMD test with this particular kernel cannot be used.

Translation Two-sample testing +1

Paper
Code

Randomized tests for high-dimensional regression: A more efficient and powerful solution

no code implementations • NeurIPS 2020 • Yue Li, Ilmun Kim, Yuting Wei

We investigate the problem of testing the global null in the high-dimensional regression models when the feature dimension $p$ grows proportionally to the number of observations $n$.

regression

Paper
Add Code

Dimension-agnostic inference using cross U-statistics

no code implementations • 10 Nov 2020 • Ilmun Kim, Aaditya Ramdas

Classical asymptotic theory for statistical inference usually involves calibrating a statistic by fixing the dimension $d$ while letting the sample size $n$ increase to infinity.

Two-sample testing

Paper
Add Code

Validation of Approximate Likelihood and Emulator Models for Computationally Intensive Simulations

1 code implementation • 27 May 2019 • Niccolò Dalmasso, Ann B. Lee, Rafael Izbicki, Taylor Pospisil, Ilmun Kim, Chieh-An Lin

At the heart of our approach is a two-sample test that quantifies the quality of the fit at fixed parameter values, and a global test that assesses goodness-of-fit across simulation parameters.

Paper
Code

Classification accuracy as a proxy for two sample testing

no code implementations • 6 Feb 2016 • Ilmun Kim, Aaditya Ramdas, Aarti Singh, Larry Wasserman

We prove two results that hold for all classifiers in any dimensions: if its true error remains $\epsilon$-better than chance for some $\epsilon>0$ as $d, n \to \infty$, then (a) the permutation-based test is consistent (has power approaching to one), (b) a computationally efficient test based on a Gaussian approximation of the null distribution is also consistent.

Classification General Classification +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.