Automatic classification of literature in systematic reviews on food safety using machine learning

Leonieke M. van den Bulk, Yamine Bouzembrak*, Anand Gavai, Ningjing Liu, Lukas J. van den Heuvel, Hans J.P. Marvin

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Systematic reviews are used to collect relevant literature to answer a research question in a way that is clear, thorough, unbiased and reproducible. They are implemented as a standard method in the domain of food safety to obtain a literature overview on the state-of-the-art research related to food safety topics of interest. A disadvantage to systematic reviews, however, is that this process is time-consuming and requires expert domain knowledge. The work reported here aims to reduce the time needed by an expert to screen all possible relevant articles by applying machine learning techniques to classify the articles automatically as either relevant or not relevant. Eight different machine learning algorithms and ensembles of all combinations of these algorithms were tested on two different systematic reviews on food safety (i.e. chemical hazards in cereals and leafy greens). The results showed that the best performance was obtained by an ensemble of naive Bayes and a support vector machine, resulting in an average decrease of 32.8% in the amount of articles the expert has to read and an average decrease in irrelevant articles of 57.8% while keeping 95% of the relevant articles. It was concluded that automatic classification of the literature in a systematic literature review can support experts in their task and save valuable time without compromising the quality of the review.

Original languageEnglish
Pages (from-to)84-95
JournalCurrent Research in Food Science
Volume5
DOIs
Publication statusPublished - Jan 2022

Keywords

  • Artificial intelligence
  • Classification models
  • Document screening
  • Food safety hazards
  • Literature reviews
  • Text mining

Fingerprint

Dive into the research topics of 'Automatic classification of literature in systematic reviews on food safety using machine learning'. Together they form a unique fingerprint.

Cite this