Non-targeted detection of food adulteration using an ensemble machine-learning model

Teresa Chung; Issan Yee San Tam; Nelly Yan Yan Lam; Yanni Yang; Boyang Liu; Billy He; Wengen Li; Jie Xu; Zhigang Yang; Lei Zhang; Jian Nong Cao; Lok-Ting Lau

doi:10.1038/s41598-022-25452-3

Scientific Reports (Dec 2022)

Non-targeted detection of food adulteration using an ensemble machine-learning model

Teresa Chung,
Issan Yee San Tam,
Nelly Yan Yan Lam,
Yanni Yang,
Boyang Liu,
Billy He,
Wengen Li,
Jie Xu,
Zhigang Yang,
Lei Zhang,
Jian Nong Cao,
Lok-Ting Lau

Affiliations

Teresa Chung: Department of Industrial and Systems Engineering, The Hong Kong Polytechnic University
Issan Yee San Tam: Research and Innovation Office, The Hong Kong Polytechnic University
Nelly Yan Yan Lam: Institute for Innovation, Translation and Policy Research, Hong Kong Baptist University
Yanni Yang: Department of Computing, The Hong Kong Polytechnic University
Boyang Liu: Inner Mongolia Mengniu Dairy (Group) Co., Ltd
Billy He: Department of Computing, The Hong Kong Polytechnic University
Wengen Li: Department of Computing, The Hong Kong Polytechnic University
Jie Xu: Danone Open Science Research Center
Zhigang Yang: Inner Mongolia Mengniu Dairy (Group) Co., Ltd
Lei Zhang: Department of Computing, The Hong Kong Polytechnic University
Jian Nong Cao: Department of Computing, The Hong Kong Polytechnic University
Lok-Ting Lau: Department of Industrial and Systems Engineering, The Hong Kong Polytechnic University

DOI: https://doi.org/10.1038/s41598-022-25452-3
Journal volume & issue: Vol. 12, no. 1
pp. 1 – 15

Abstract

Read online

Abstract Recurrent incidents of economically motivated adulteration have long-lasting and devastating effects on public health, economy, and society. With the current food authentication methods being target-oriented, the lack of an effective methodology to detect unencountered adulterants can lead to the next melamine-like outbreak. In this study, an ensemble machine-learning model that can help detect unprecedented adulteration without looking for specific substances, that is, in a non-targeted approach, is proposed. Using raw milk as an example, the proposed model achieved an accuracy and F1 score of 0.9924 and 0. 0.9913, respectively, when the same type of adulterants was presented in the training data. Cross-validation with spiked contaminants not routinely tested in the food industry and blinded from the training data provided an F1 score of 0.8657. This is the first study that demonstrates the feasibility of non-targeted detection with no a priori knowledge of the presence of certain adulterants using data from standard industrial testing as input. By uncovering discriminative profiling patterns, the ensemble machine-learning model can monitor and flag suspicious samples; this technique can potentially be extended to other food commodities and thus become an important contributor to public food safety.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal