Frontiers in Microbiology (Apr 2024)

Rapid identification of lactic acid bacteria at species/subspecies level via ensemble learning of Ramanomes

  • Yan Ren,
  • Yan Ren,
  • Yang Zheng,
  • Xiaojing Wang,
  • Shuang Qu,
  • Lijun Sun,
  • Chenyong Song,
  • Jia Ding,
  • Yuetong Ji,
  • Yuetong Ji,
  • Guoze Wang,
  • Guoze Wang,
  • Pengfei Zhu,
  • Pengfei Zhu,
  • Likun Cheng,
  • Likun Cheng

DOI
https://doi.org/10.3389/fmicb.2024.1361180
Journal volume & issue
Vol. 15

Abstract

Read online

Rapid and accurate identification of lactic acid bacteria (LAB) species would greatly improve the screening rate for functional LAB. Although many conventional and molecular methods have proven efficient and reliable, LAB identification using these methods has generally been slow and tedious. Single-cell Raman spectroscopy (SCRS) provides the phenotypic profile of a single cell and can be performed by Raman spectroscopy (which directly detects vibrations of chemical bonds through inelastic scattering by a laser light) using an individual live cell. Recently, owing to its affordability, non-invasiveness, and label-free features, the Ramanome has emerged as a potential technique for fast bacterial detection. Here, we established a reference Ramanome database consisting of SCRS data from 1,650 cells from nine LAB species/subspecies and conducted further analysis using machine learning approaches, which have high efficiency and accuracy. We chose the ensemble meta-classifier (EMC), which is suitable for solving multi-classification problems, to perform in-depth mining and analysis of the Ramanome data. To optimize the accuracy and efficiency of the machine learning algorithm, we compared nine classifiers: LDA, SVM, RF, XGBoost, KNN, PLS-DA, CNN, LSTM, and EMC. EMC achieved the highest average prediction accuracy of 97.3% for recognizing LAB at the species/subspecies level. In summary, Ramanomes, with the integration of EMC, have promising potential for fast LAB species/subspecies identification in laboratories and may thus be further developed and sharpened for the direct identification and prediction of LAB species from fermented food.

Keywords