no code implementations • 10 Jan 2024 • Soham Jana, Jianqing Fan, Sanjeev Kulkarni
In this paper, we introduce a hybrid clustering technique with a novel multivariate trimmed mean type centroid estimate to produce mislabeling guarantees under a weak initialization condition for general error distributions around the centroids.
no code implementations • 16 Jun 2023 • Soham Jana, Kun Yang, Sanjeev Kulkarni
In the absence of outliers, in fixed dimensions, our theoretical guarantees are similar to that of the Lloyd algorithm.
1 code implementation • 29 Oct 2021 • Soham Jana, Henry Li, Yutaro Yamada, Ofir Lindenbaum
Consider the problem of simultaneous estimation and support recovery of the coefficient vector in a linear data model with additive Gaussian noise.
no code implementations • 21 May 2020 • Soham Jana, Yury Polyanskiy, Yihong Wu
Nevertheless, we show that in the sublinear regime of $m =\omega(k/\log k)$, it is possible to consistently estimate in total variation the \emph{profile} of the population, defined as the empirical distribution of the sizes of each type, which determines many symmetric properties of the population.