TY - JOUR
T1 - Sequentially orthogonalized canonical partial least squares for improved multiple responses modeling in multiblock data sets
AU - Mishra, Puneet
PY - 2023/4/15
Y1 - 2023/4/15
N2 - Multiblock data sets and modeling techniques are widely encountered in the chemometric community. Although the currently available techniques, such as sequential orthogonalized partial least squares (SO-PLS) regression are mainly focused on the prediction of a single response and deal with the multiple response(s) case using PLS2 type approach. Recently, a new approach called canonical PLS (CPLS) was proposed for extracting the subspaces efficiently for multiple response(s) cases, supporting both regression and classification. 'Efficiently' here means more information in fewer latent variables. This work suggests a combination of SO-PLS and CPLS, sequential orthogonalized canonical partial least squares (SO-CPLS), to model multiple response(s) for multiblock data sets. The cases of SO-CPLS for modeling multiple response(s) regression and classification were demonstrated on several data sets. Also, the capability of SO-CPLS to incorporate meta-information related to samples for efficient subspace extraction is demonstrated. Furthermore, a comparison with the commonly used sequential modeling technique, called sequential orthogonalized partial least squares (SO-PLS), is also presented. The SO-CPLS approach can benefit both the multiple response(s) regression and classification modeling and can be of high importance when meta-information such as experimental design or sample classes is available.
AB - Multiblock data sets and modeling techniques are widely encountered in the chemometric community. Although the currently available techniques, such as sequential orthogonalized partial least squares (SO-PLS) regression are mainly focused on the prediction of a single response and deal with the multiple response(s) case using PLS2 type approach. Recently, a new approach called canonical PLS (CPLS) was proposed for extracting the subspaces efficiently for multiple response(s) cases, supporting both regression and classification. 'Efficiently' here means more information in fewer latent variables. This work suggests a combination of SO-PLS and CPLS, sequential orthogonalized canonical partial least squares (SO-CPLS), to model multiple response(s) for multiblock data sets. The cases of SO-CPLS for modeling multiple response(s) regression and classification were demonstrated on several data sets. Also, the capability of SO-CPLS to incorporate meta-information related to samples for efficient subspace extraction is demonstrated. Furthermore, a comparison with the commonly used sequential modeling technique, called sequential orthogonalized partial least squares (SO-PLS), is also presented. The SO-CPLS approach can benefit both the multiple response(s) regression and classification modeling and can be of high importance when meta-information such as experimental design or sample classes is available.
KW - Complementary
KW - Data Fusion
KW - Multiple responses
KW - Multivariate
U2 - 10.1016/j.aca.2023.340957
DO - 10.1016/j.aca.2023.340957
M3 - Article
C2 - 36898815
AN - SCOPUS:85150008814
SN - 0003-2670
VL - 1250
SP - 340957
JO - Analytica Chimica Acta
JF - Analytica Chimica Acta
ER -