Sparse features for PCA-like linear regression

Christos Boutsidis; Petros Drineas; Malik Magdon-Ismail

NeurIPS 2011

Conference paper

12 Dec 2011

Sparse features for PCA-like linear regression

Abstract

Principal Components Analysis (PCA) is often used as a feature extraction procedure. Given a matrix X ∈ ℝn×d, whose rows represent n data points with respect to d features, the top κ right singular vectors of X (the so-called eigenfeatures), are arbitrary linear combinations of all available features. The eigenfeatures are very useful in data analysis, including the regularization of linear regression. Enforcing sparsity on the eigenfeatures, i.e., forcing them to be linear combinations of only a small number of actual features (as opposed to all available features), can promote better generalization error and improve the interpretability of the eigenfeatures. We present deterministic and randomized algorithms that construct such sparse eigenfeatureswhile provably achieving in-sample performance comparable to regularized linear regression. Our algorithms are relatively simple and practically efficient, and we demonstrate their performance on several data sets.

Paper