no code implementations • 5 Jun 2023 • Yecheng Xue, Xiaoyu Chen, Tongyang Li, Shaofeng H. -C. Jiang
$k$-Clustering in $\mathbb{R}^d$ (e. g., $k$-median and $k$-means) is a fundamental machine learning problem.
no code implementations • 1 Oct 2022 • Kuan Cheng, Shaofeng H. -C. Jiang, Luojian Wei, Zhide Wei
The method of random Fourier features (RFF), proposed in a seminal paper by Rahimi and Recht (NIPS'07), is a powerful technique to find approximate low-dimensional representations of points in (high-dimensional) kernel space, for shift-invariant kernels.
1 code implementation • NeurIPS 2019 • Lingxiao Huang, Shaofeng H. -C. Jiang, Nisheeth K. Vishnoi
Our approach is based on novel constructions of coresets: for the $k$-median objective, we construct an $\varepsilon$-coreset of size $O(\Gamma k^2 \varepsilon^{-d})$ where $\Gamma$ is the number of distinct collections of groups that a point may belong to, and for the $k$-means objective, we show how to construct an $\varepsilon$-coreset of size $O(\Gamma k^3\varepsilon^{-d-1})$.
1 code implementation • 11 Mar 2019 • Vladimir Braverman, Shaofeng H. -C. Jiang, Robert Krauthgamer, Xuan Wu
We design coresets for Ordered k-Median, a generalization of classical clustering problems such as k-Median and k-Center, that offers a more flexible data analysis, like easily combining multiple objectives (e. g., to increase fairness or for Pareto optimization).
Data Structures and Algorithms