TY - JOUR
T1 - New data preprocessing trends based on ensemble of multiple preprocessing techniques
AU - Mishra, Puneet
AU - Biancolillo, Alessandra
AU - Roger, Jean Michel
AU - Marini, Federico
AU - Rutledge, Douglas N.
PY - 2020/11
Y1 - 2020/11
N2 - Data generated by analytical instruments, such as spectrometers, may contain unwanted variation due to measurement mode, sample state and other external physical, chemical and environmental factors. Preprocessing is required so that the property of interest can be predicted correctly. Different correction methods may remove specific types of artefacts while still leaving some effects behind. Using multiple preprocessing in a complementary way can remove the artefacts that would be left behind by using only one technique. This article summarizes the recent developments in new data preprocessing strategies and specifically reviews the emerging ensemble approaches to preprocessing fusion in chemometrics. A demonstration case is also presented. In summary, ensemble preprocessing allows the selection of several techniques and their combinations that, in a complementary way, lead to improved models. Ensemble approaches are not limited to spectral data but can be used in all cases where preprocessing is needed and identification of a single best option is not easily done.
AB - Data generated by analytical instruments, such as spectrometers, may contain unwanted variation due to measurement mode, sample state and other external physical, chemical and environmental factors. Preprocessing is required so that the property of interest can be predicted correctly. Different correction methods may remove specific types of artefacts while still leaving some effects behind. Using multiple preprocessing in a complementary way can remove the artefacts that would be left behind by using only one technique. This article summarizes the recent developments in new data preprocessing strategies and specifically reviews the emerging ensemble approaches to preprocessing fusion in chemometrics. A demonstration case is also presented. In summary, ensemble preprocessing allows the selection of several techniques and their combinations that, in a complementary way, lead to improved models. Ensemble approaches are not limited to spectral data but can be used in all cases where preprocessing is needed and identification of a single best option is not easily done.
KW - Chemometrics
KW - Ensemble learning
KW - Multi-block analysis
KW - Multivariate calibration
KW - Preprocessing
U2 - 10.1016/j.trac.2020.116045
DO - 10.1016/j.trac.2020.116045
M3 - Article
AN - SCOPUS:85092044433
SN - 0165-9936
VL - 132
JO - TrAC : Trends in Analytical Chemistry
JF - TrAC : Trends in Analytical Chemistry
M1 - 116045
ER -