Search Results for author: Christos Boutsidis

Found 12 papers, 0 papers with code

Optimal Sparse Linear Encoders and Sparse PCA

no code implementations • NeurIPS 2016 • Malik Magdon-Ismail, Christos Boutsidis

Principal components analysis~(PCA) is the optimal linear encoder of data.

Paper
Add Code

Optimal Sparse Linear Auto-Encoders and Sparse PCA

no code implementations • 23 Feb 2015 • Malik Magdon-Ismail, Christos Boutsidis

Principal components analysis (PCA) is the optimal linear auto-encoder of data, and it is often used to construct features.

Natural Questions

Paper
Add Code

Optimal CUR Matrix Decompositions

no code implementations • 30 May 2014 • Christos Boutsidis, David P. Woodruff

The CUR decomposition of an $m \times n$ matrix $A$ finds an $m \times c$ matrix $C$ with a subset of $c < n$ columns of $A,$ together with an $r \times n$ matrix $R$ with a subset of $r < m$ rows of $A,$ as well as a $c \times r$ low-rank matrix $U$ such that the matrix $C U R$ approximates the matrix $A,$ that is, $ || A - CUR ||_F^2 \le (1+\epsilon) || A - A_k||_F^2$, where $||.||_F$ denotes the Frobenius norm and $A_k$ is the best $m \times n$ matrix of rank $k$ constructed via the SVD.

Paper
Add Code

Provable Deterministic Leverage Score Sampling

no code implementations • 6 Apr 2014 • Dimitris Papailiopoulos, Anastasios Kyrillidis, Christos Boutsidis

We explain theoretically a curious empirical phenomenon: "Approximating a matrix by deterministically selecting a subset of its columns with the corresponding largest leverage scores results in a good low-rank matrix surrogate".

Paper
Add Code

Spectral Clustering via the Power Method -- Provably

no code implementations • 12 Nov 2013 • Christos Boutsidis, Alex Gittens, Prabhanjan Kambadur

Spectral clustering is one of the most important algorithms in data mining and machine intelligence; however, its computational complexity limits its application to truly large scale data analysis.

Clustering

Paper
Add Code

Random Projections for Linear Support Vector Machines

no code implementations • 26 Nov 2012 • Saurabh Paul, Christos Boutsidis, Malik Magdon-Ismail, Petros Drineas

Let X be a data matrix of rank \rho, whose rows represent n points in d-dimensional space.

Dimensionality Reduction General Classification +1

Paper
Add Code

Near-optimal Coresets For Least-Squares Regression

no code implementations • 16 Feb 2012 • Christos Boutsidis, Petros Drineas, Malik Magdon-Ismail

We study (constrained) least-squares regression as well as multiple response least-squares regression and ask the question of whether a subset of the data, a coreset, suffices to compute a good approximate solution to the regression.

regression

Paper
Add Code

Sparse Features for PCA-Like Linear Regression

no code implementations • NeurIPS 2011 • Christos Boutsidis, Petros Drineas, Malik Magdon-Ismail

Principal Components Analysis~(PCA) is often used as a feature extraction procedure.

regression

Paper
Add Code

Randomized Dimensionality Reduction for k-means Clustering

no code implementations • 13 Oct 2011 • Christos Boutsidis, Anastasios Zouzias, Michael W. Mahoney, Petros Drineas

On the other hand, two provably accurate feature extraction methods for $k$-means clustering are known in the literature; one is based on random projections and the other is based on the singular value decomposition (SVD).

Clustering Dimensionality Reduction +1