no code implementations • NeurIPS 2016 • Malik Magdon-Ismail, Christos Boutsidis
Principal components analysis~(PCA) is the optimal linear encoder of data.
no code implementations • 23 Feb 2015 • Malik Magdon-Ismail, Christos Boutsidis
Principal components analysis (PCA) is the optimal linear auto-encoder of data, and it is often used to construct features.
no code implementations • 30 May 2014 • Christos Boutsidis, David P. Woodruff
The CUR decomposition of an $m \times n$ matrix $A$ finds an $m \times c$ matrix $C$ with a subset of $c < n$ columns of $A,$ together with an $r \times n$ matrix $R$ with a subset of $r < m$ rows of $A,$ as well as a $c \times r$ low-rank matrix $U$ such that the matrix $C U R$ approximates the matrix $A,$ that is, $ || A - CUR ||_F^2 \le (1+\epsilon) || A - A_k||_F^2$, where $||.||_F$ denotes the Frobenius norm and $A_k$ is the best $m \times n$ matrix of rank $k$ constructed via the SVD.
no code implementations • 6 Apr 2014 • Dimitris Papailiopoulos, Anastasios Kyrillidis, Christos Boutsidis
We explain theoretically a curious empirical phenomenon: "Approximating a matrix by deterministically selecting a subset of its columns with the corresponding largest leverage scores results in a good low-rank matrix surrogate".
no code implementations • 12 Nov 2013 • Christos Boutsidis, Alex Gittens, Prabhanjan Kambadur
Spectral clustering is one of the most important algorithms in data mining and machine intelligence; however, its computational complexity limits its application to truly large scale data analysis.
no code implementations • 26 Nov 2012 • Saurabh Paul, Christos Boutsidis, Malik Magdon-Ismail, Petros Drineas
Let X be a data matrix of rank \rho, whose rows represent n points in d-dimensional space.
no code implementations • 16 Feb 2012 • Christos Boutsidis, Petros Drineas, Malik Magdon-Ismail
We study (constrained) least-squares regression as well as multiple response least-squares regression and ask the question of whether a subset of the data, a coreset, suffices to compute a good approximate solution to the regression.
no code implementations • NeurIPS 2011 • Christos Boutsidis, Petros Drineas, Malik Magdon-Ismail
Principal Components Analysis~(PCA) is often used as a feature extraction procedure.
no code implementations • 13 Oct 2011 • Christos Boutsidis, Anastasios Zouzias, Michael W. Mahoney, Petros Drineas
On the other hand, two provably accurate feature extraction methods for $k$-means clustering are known in the literature; one is based on random projections and the other is based on the singular value decomposition (SVD).
no code implementations • 26 Sep 2011 • Christos Boutsidis, Malik Magdon-Ismail
We study feature selection for $k$-means clustering.
no code implementations • NeurIPS 2010 • Christos Boutsidis, Anastasios Zouzias, Petros Drineas
This paper discusses the topic of dimensionality reduction for $k$-means clustering.
no code implementations • NeurIPS 2009 • Christos Boutsidis, Petros Drineas, Michael W. Mahoney
We present a novel feature selection algorithm for the $k$-means clustering problem.