Microbiology Spectrum (Oct 2024)
Classification of Latilactobacillus sakei subspecies based on MALDI-TOF MS protein profiles using machine learning models
Abstract
ABSTRACT Latilactobacillus sakei is an important bacterial species used as a starter culture for fermented foods; however, two subspecies within this species exhibit different properties in the foods. Matrix-assisted laser desorption/ionization-time of flight mass spectrometer (MALDI-TOF MS) is the gold standard for microbial fingerprinting. However, the resolution power is down to the species level. This study was to combine MALDI-TOF mass spectra and machine learning to develop a new method to identify two L. sakei subspecies (L. sakei subsp. sakei and L. sakei subsp. carnosus) and non-L. sakei species. Totally, 227 strains were collected, with 908 spectra obtained via on- and off-plate protein extraction. Only 68.7% of strains were correctly identified at the subspecies level in the Biotyper database; however, a high level of performance was observed from the machine learning models. Partial least squares-discriminant analysis (PLS-DA), principal component analysis-K-nearest neighbor (PCA-KNN), and support vector machine (SVM) demonstrated 0.823, 0.914, and 0.903 accuracies, respectively, whereas the random forest (RF) achieved an accuracy of 0.954, with an area under the receiver operating characteristic (AUROC) curve of 0.99, outperforming the other algorithms in distinguishing the subspecies. The machine learning proved to be a promising technique for the rapid and high-resolution classification of L. sakei subspecies using MALDI-TOF MS.IMPORTANCELatilactobacillus sakei plays a significant role in the realm of food bacteria. One particular subspecies of L. sakei is employed as a protective agent during food fermentation, whereas another strain is responsible for food spoilage. Hence, it is crucial to precisely differentiate between the two subspecies of L. sakei. In this study, machine learning models based on protein mass peaks were developed for the first time to distinguish L. sakei subspecies. Furthermore, the efficacy of three commonly used machine learning algorithms for microbial classification was evaluated. Our results provide the foundation for future research on developing machine learning models for the classification of microbial species or subspecies. In addition, the developed model can be used in the food industry to monitor L. sakei subspecies in fermented foods in a time- and cost-effective method for food quality and safety.
Keywords