Search Results for author: Richard J. Samworth

Found 17 papers, 2 papers with code

Optimal convex $M$-estimation via score matching

no code implementations25 Mar 2024 Oliver Y. Feng, Yu-Chun Kao, Min Xu, Richard J. Samworth

As an example of a non-log-concave setting, for Cauchy errors, the optimal convex loss function is Huber-like, and our procedure yields an asymptotic efficiency greater than 0. 87 relative to the oracle maximum likelihood estimator of the regression coefficients that uses knowledge of this error distribution; in this sense, we obtain robustness without sacrificing much efficiency.

regression

Sharp-SSL: Selective high-dimensional axis-aligned random projections for semi-supervised learning

no code implementations18 Apr 2023 Tengyao Wang, Edgar Dobriban, Milana Gataric, Richard J. Samworth

We propose a new method for high-dimensional semi-supervised learning problems based on the careful aggregation of the results of a low-dimensional procedure applied to many axis-aligned random projections of the data.

The Projected Covariance Measure for assumption-lean variable significance testing

1 code implementation3 Nov 2022 Anton Rask Lundborg, Ilmun Kim, Rajen D. Shah, Richard J. Samworth

In this work we study the problem of testing the model-free null of conditional mean independence, i. e. that the conditional mean of $Y$ given $X$ and $Z$ does not depend on $X$.

Additive models regression

Optimal subgroup selection

no code implementations2 Sep 2021 Henry W. J. Reeve, Timothy I. Cannings, Richard J. Samworth

We formulate the problem as one of constrained optimisation, where we seek a low-complexity, data-dependent selection set on which, with a guaranteed probability, the regression function is uniformly at least as large as the threshold; subject to this constraint, we would like the region to contain as much mass under the marginal feature distribution as possible.

regression

Adaptive transfer learning

no code implementations8 Jun 2021 Henry W. J. Reeve, Timothy I. Cannings, Richard J. Samworth

In transfer learning, we wish to make inference about a target population when we have access to data both from the distribution itself, and from a different but related source distribution.

Binary Classification Transfer Learning

USP: an independence test that improves on Pearson's chi-squared and the $G$-test

no code implementations26 Jan 2021 Thomas B. Berrett, Richard J. Samworth

We present the $U$-Statistic Permutation (USP) test of independence in the context of discrete data displayed in a contingency table.

Isotonic regression with unknown permutations: Statistics, computation, and adaptation

no code implementations5 Sep 2020 Ashwin Pananjady, Richard J. Samworth

Motivated by models for multiway comparison data, we consider the problem of estimating a coordinate-wise isotonic function on the domain $[0, 1]^d$ from noisy observations collected on a uniform lattice, but where the design points have been permuted along each dimension.

Computational Efficiency regression

High-dimensional, multiscale online changepoint detection

no code implementations7 Mar 2020 Yudong Chen, Tengyao Wang, Richard J. Samworth

We introduce a new method for high-dimensional, online changepoint detection in settings where a $p$-variate Gaussian data stream may undergo a change in mean.

Vocal Bursts Intensity Prediction

Optimal rates for independence testing via $U$-statistic permutation tests

no code implementations15 Jan 2020 Thomas B. Berrett, Ioannis Kontoyiannis, Richard J. Samworth

We study the problem of independence testing given independent and identically distributed pairs taking values in a $\sigma$-finite, separable measure space.

valid

Goodness-of-fit testing in high-dimensional generalized linear models

2 code implementations9 Aug 2019 Jana Janková, Rajen D. Shah, Peter Bühlmann, Richard J. Samworth

We propose a family of tests to assess the goodness-of-fit of a high-dimensional generalized linear model.

Methodology Statistics Theory Statistics Theory

Efficient functional estimation and the super-oracle phenomenon

no code implementations18 Apr 2019 Thomas B. Berrett, Richard J. Samworth

One interesting consequence of our results is the discovery that, for certain functionals, the worst-case performance of our estimator may improve on that of the natural `oracle' estimator, which is given access to the values of the unknown densities at the observations.

valid

The conditional permutation test for independence while controlling for confounders

no code implementations14 Jul 2018 Thomas B. Berrett, Yi Wang, Rina Foygel Barber, Richard J. Samworth

Like the conditional randomization test of Cand\`es et al. (2018), our test relies on the availability of an approximation to the distribution of $X \mid Z$.

Methodology Statistics Theory Statistics Theory

Classification with imperfect training labels

no code implementations29 May 2018 Timothy I. Cannings, Yingying Fan, Richard J. Samworth

One consequence of these results is that the knn and SVM classifiers are robust to imperfect training labels, in the sense that the rate of convergence of the excess risks of these classifiers remains unchanged; in fact, our theoretical and empirical results even show that in some cases, imperfect labels may improve the performance of these methods.

Classification General Classification

Sparse principal component analysis via axis-aligned random projections

no code implementations15 Dec 2017 Milana Gataric, Tengyao Wang, Richard J. Samworth

We introduce a new method for sparse principal component analysis, based on the aggregation of eigenvector information from carefully-selected axis-aligned random projections of the sample covariance matrix.

Nonparametric independence testing via mutual information

no code implementations17 Nov 2017 Thomas B. Berrett, Richard J. Samworth

We propose a test of independence of two multivariate random vectors, given a sample from the underlying population.

Local nearest neighbour classification with applications to semi-supervised learning

no code implementations3 Apr 2017 Timothy I. Cannings, Thomas B. Berrett, Richard J. Samworth

We derive a new asymptotic expansion for the global excess risk of a local-$k$-nearest neighbour classifier, where the choice of $k$ may depend upon the test point.

Classification General Classification

Statistical and computational trade-offs in estimation of sparse principal components

no code implementations22 Aug 2014 Tengyao Wang, Quentin Berthet, Richard J. Samworth

In this paper, we show that, under a widely-believed assumption from computational complexity theory, there is a fundamental trade-off between statistical and computational performance in this problem.

Computational Efficiency Dimensionality Reduction

Cannot find the paper you are looking for? You can Submit a new open access paper.