TY - JOUR
T1 - An automated deep learning pipeline based on advanced optimisations for leveraging spectral classification modelling
AU - Passos, Dário
AU - Mishra, Puneet
PY - 2021/8/15
Y1 - 2021/8/15
N2 - In deep learning (DL) modelling for spectral data, a major challenge is related to the choice of DL network architecture and the selection of the best hyperparameters. Often, slight changes to the neural architecture or its hyperparameter can have a direct influence on the model's performance, making its robustness questionable. To deal with it, this study presents an automated deep learning modelling based on advanced optimisation techniques involving Hyperband and Bayesian optimisation, to automatically find optimal neural architecture and its hyperparameters to reach robust DL models. The optimisation requires a base neural architecture to be initialized, however, later it automatically adjusts the neural architecture and the hyperparameters to reach the optimal model. Furthermore, to support the interpretation of the DL models, a wavelength weighing schema based on gradient-weighted class activation mapping (Grad-CAM) was implemented. The potential of the approach was showed on a real case of wheat variety classification with near-infrared spectral data. The performance of the classification was compared with that previously reported on the same dataset with different DL and chemometric approaches. The results showed that with the proposed approach a classification accuracy of 94.9% was reached, which was better than the best reported accuracy on the same data set i.e., 93%. Furthermore, the better performance was obtained with a simpler neural architecture compared to what was used in earlier studies. The automated deep learning based on advanced optimisation can support DL modelling of spectral data.
AB - In deep learning (DL) modelling for spectral data, a major challenge is related to the choice of DL network architecture and the selection of the best hyperparameters. Often, slight changes to the neural architecture or its hyperparameter can have a direct influence on the model's performance, making its robustness questionable. To deal with it, this study presents an automated deep learning modelling based on advanced optimisation techniques involving Hyperband and Bayesian optimisation, to automatically find optimal neural architecture and its hyperparameters to reach robust DL models. The optimisation requires a base neural architecture to be initialized, however, later it automatically adjusts the neural architecture and the hyperparameters to reach the optimal model. Furthermore, to support the interpretation of the DL models, a wavelength weighing schema based on gradient-weighted class activation mapping (Grad-CAM) was implemented. The potential of the approach was showed on a real case of wheat variety classification with near-infrared spectral data. The performance of the classification was compared with that previously reported on the same dataset with different DL and chemometric approaches. The results showed that with the proposed approach a classification accuracy of 94.9% was reached, which was better than the best reported accuracy on the same data set i.e., 93%. Furthermore, the better performance was obtained with a simpler neural architecture compared to what was used in earlier studies. The automated deep learning based on advanced optimisation can support DL modelling of spectral data.
KW - artificial Intelligence
KW - Crop
KW - Phenotyping
KW - Spectroscopy
U2 - 10.1016/j.chemolab.2021.104354
DO - 10.1016/j.chemolab.2021.104354
M3 - Article
AN - SCOPUS:85107695931
SN - 0169-7439
VL - 215
JO - Chemometrics and Intelligent Laboratory Systems
JF - Chemometrics and Intelligent Laboratory Systems
M1 - 104354
ER -