TY - JOUR
T1 - Group-wise Principal Component Analysis for Exploratory Data Analysis
AU - Camacho, J.
AU - Rodriquez-Gomez, Rafael A.
AU - Saccenti, E.
PY - 2017
Y1 - 2017
N2 - In this paper, we propose a new framework for matrix factorization based on Principal Component Analysis (PCA) where sparsity is imposed. The structure to impose sparsity is defined in terms of groups of correlated variables found in correlation matrices or maps. The framework is based on three new contributions: an algorithm to identify the groups of variables in correlation maps, a visualization for the resulting groups and a matrix factorization. Together with a method to compute correlation maps with minimum noise level, referred to as Missing-Data for Exploratory Data Analysis (MEDA), these three contributions constitute a complete matrix factorization framework. Two real examples are used to illustrate the approach and compare it with PCA, Sparse PCA and Structured Sparse PCA.
AB - In this paper, we propose a new framework for matrix factorization based on Principal Component Analysis (PCA) where sparsity is imposed. The structure to impose sparsity is defined in terms of groups of correlated variables found in correlation matrices or maps. The framework is based on three new contributions: an algorithm to identify the groups of variables in correlation maps, a visualization for the resulting groups and a matrix factorization. Together with a method to compute correlation maps with minimum noise level, referred to as Missing-Data for Exploratory Data Analysis (MEDA), these three contributions constitute a complete matrix factorization framework. Two real examples are used to illustrate the approach and compare it with PCA, Sparse PCA and Structured Sparse PCA.
KW - Exploratory Data Analysis, Missing-Data, Sparsity, Matrix Factorization, Visualization
UR - https://doi.org/10.6084/m9.figshare.4288454
UR - https://doi.org/10.6084/m9.figshare.4288451
U2 - 10.1080/10618600.2016.1265527
DO - 10.1080/10618600.2016.1265527
M3 - Article
SN - 1061-8600
VL - 26
SP - 501
EP - 512
JO - Journal of Computational and Graphical Statistics
JF - Journal of Computational and Graphical Statistics
IS - 3
ER -