# Dimensionality Reduction of Massive Sparse Datasets Using Coresets

Dan FeldmanMikhail VolkovDaniela Rus

In this paper we present a practical solution with performance guarantees to the problem of dimensionality reduction for very large scale sparse matrices. We show applications of our approach to computing the Principle Component Analysis (PCA) of any $n\times d$ matrix, using one pass over the stream of its rows... (read more)

PDF Abstract