Scientific Reports (Dec 2022)
Non-targeted detection of food adulteration using an ensemble machine-learning model
Abstract
Abstract Recurrent incidents of economically motivated adulteration have long-lasting and devastating effects on public health, economy, and society. With the current food authentication methods being target-oriented, the lack of an effective methodology to detect unencountered adulterants can lead to the next melamine-like outbreak. In this study, an ensemble machine-learning model that can help detect unprecedented adulteration without looking for specific substances, that is, in a non-targeted approach, is proposed. Using raw milk as an example, the proposed model achieved an accuracy and F1 score of 0.9924 and 0. 0.9913, respectively, when the same type of adulterants was presented in the training data. Cross-validation with spiked contaminants not routinely tested in the food industry and blinded from the training data provided an F1 score of 0.8657. This is the first study that demonstrates the feasibility of non-targeted detection with no a priori knowledge of the presence of certain adulterants using data from standard industrial testing as input. By uncovering discriminative profiling patterns, the ensemble machine-learning model can monitor and flag suspicious samples; this technique can potentially be extended to other food commodities and thus become an important contributor to public food safety.