TY - JOUR
T1 - Autoencoder-based 3D representation learning for industrial seedling abnormality detection
AU - de Villiers, Hendrik A.C.
AU - Otten, Gerwoud
AU - Chauhan, Aneesh
AU - Meesters, Lydia
PY - 2023/3
Y1 - 2023/3
N2 - Industrial seedling quality assessment, such as attempting to find abnormal seedlings, is a challenging task where assessment methods must contend with the natural variability of seedlings, as well as the subjective nature of expert judgements. Furthermore, obtaining expert judgements is expensive and time-consuming, so machine learning approaches which rely on fewer judgements would be useful in practice. We investigate autoencoders, operating on 3D point clouds obtained from 6732 seedlings to address this challenge, exploiting such systems’ ability to work with partially labelled data. Point clouds from tomato seedlings are recorded using a 3D data capture platform, MARVIN™, and the quality of each seedling is determined by expert judgement. An existing system is used to establish baseline performance scores using a rule-based expert system and machine learning with handcrafted features. Autoencoders are trained on the point clouds to learn representations for subsequent use in classification. We examine scenarios where large amounts of partially labelled data are available, and compare with the case where fully labelled data is available. To improve performance, we compare the architectural subcomponents based on PointNet and PointNet++, as well as the effect of different training strategies. We find, with 13.6% of training data labelled, our model has correct classification rates of 97.7% and 82.7% for normals and abnormals respectively. With further improvements and fully labelled data, we find that correct classification rates of 97.6% and 96.1% can be reached. The results demonstrate that semi-supervised learning supported by partially labelled data has the potential to greatly reduce the cost of data curation, with minimal impact on overall accuracy.
AB - Industrial seedling quality assessment, such as attempting to find abnormal seedlings, is a challenging task where assessment methods must contend with the natural variability of seedlings, as well as the subjective nature of expert judgements. Furthermore, obtaining expert judgements is expensive and time-consuming, so machine learning approaches which rely on fewer judgements would be useful in practice. We investigate autoencoders, operating on 3D point clouds obtained from 6732 seedlings to address this challenge, exploiting such systems’ ability to work with partially labelled data. Point clouds from tomato seedlings are recorded using a 3D data capture platform, MARVIN™, and the quality of each seedling is determined by expert judgement. An existing system is used to establish baseline performance scores using a rule-based expert system and machine learning with handcrafted features. Autoencoders are trained on the point clouds to learn representations for subsequent use in classification. We examine scenarios where large amounts of partially labelled data are available, and compare with the case where fully labelled data is available. To improve performance, we compare the architectural subcomponents based on PointNet and PointNet++, as well as the effect of different training strategies. We find, with 13.6% of training data labelled, our model has correct classification rates of 97.7% and 82.7% for normals and abnormals respectively. With further improvements and fully labelled data, we find that correct classification rates of 97.6% and 96.1% can be reached. The results demonstrate that semi-supervised learning supported by partially labelled data has the potential to greatly reduce the cost of data curation, with minimal impact on overall accuracy.
KW - 3D
KW - Autoencoder
KW - Deep learning
KW - PointNet
KW - Seedling
KW - Usable transplant test
U2 - 10.1016/j.compag.2023.107619
DO - 10.1016/j.compag.2023.107619
M3 - Article
AN - SCOPUS:85147605211
SN - 0168-1699
VL - 206
JO - Computers and Electronics in Agriculture
JF - Computers and Electronics in Agriculture
M1 - 107619
ER -