PLoS ONE (Jan 2024)
A machine learning approach for rapid early detection of Campylobacter spp. using absorbance spectra collected from enrichment cultures.
Abstract
Enumeration of Campylobacter from environmental waters can be difficult due to its low concentrations, which can still pose a significant health risk. Spectrophotometry is an approach commonly used for fast detection of water-borne pollutants in water samples, but it has not been used for pathogen detection, which is commonly done through a laborious and time-consuming culture or qPCR Most Probable Number enumeration methods (i.e., MPN-PCR approaches). In this study, we proposed a new method, MPN-Spectro-ML, that can provide rapid evidence of Campylobacter detection and, hence, water concentrations. After an initial incubation, the samples were analysed using a spectrophotometer, and the spectrum data were used to train three machine learning (ML) models (i.e., supported vector machine - SVM, logistic regression-LR, and random forest-RF). The trained models were used to predict the presence of Campylobacter in the enriched water samples and estimate the most probable number (MPN). Over 100 stormwater, river, and creek samples (including both fresh and brackish water) from rural and urban catchments were collected to test the accuracy of the MPN-Spectro-ML method under various scenarios and compared to a previously standardised MPN-PCR method. Differences in the spectrum were found between positive and negative control samples, with two distinctive absorbance peaks between 540-542nm and 575-576nm for positive samples. Further, the three ML models had similar performance irrespective of the scenario tested with average prediction accuracy (ACC) and false negative rates at 0.763 and 13.8%, respectively. However, the predicted MPN of Campylobacter from the new method varied from the traditional MPN-PCR method, with a maximum Nash-Sutcliffe coefficient of 0.44 for the urban catchment dataset. Nevertheless, the MPN values based on these two methods were still comparable, considering the confidence intervals and large uncertainties associated with MPN estimation. The study reveals the potential of this novel approach for providing interim evidence of the presence and levels of Campylobacter within environmental water bodies. This, in turn, decreases the time from risk detection to management for the benefit of public health.