npj Biofilms and Microbiomes (Sep 2024)
Integrating respiratory microbiome and host immune response through machine learning for respiratory tract infection diagnosis
Abstract
Abstract At present, the diagnosis of lower respiratory tract infections (LRTIs) is difficult, and there is an urgent need for better diagnostic methods. This study enrolled 136 patients from 2020 to 2021 and collected bronchoalveolar lavage fluid (BALF) specimens. We used metatranscriptome to analyze the lower respiratory tract microbiome (LRTM) and host immune response. The diversity of the LRTM in LRTIs significantly decreased, manifested by a decrease in the abundance of normal microbiota and an increase in the abundance of opportunistic pathogens. The upregulated differentially expressed genes (DEGs) in the LRTIs group were mainly enriched in infection immune response-related pathways. Klebsiella pneumoniae had the most significant increase in abundance in LRTIs, which was strongly correlated with host infection or inflammation genes TNFRSF1B, CSF3R, and IL6R. We combined LRTM and host transcriptome data to construct a machine-learning model with 12 screened features to discriminate LRTIs and non-LRTIs. The results showed that the model trained by Random Forest in the validate set had the best performance (ROC AUC: 0.937, 95% CI: 0.832–1). The independent external dataset showed an accuracy of 76.5% for this model. This study suggests that the model integrating LRTM and host transcriptome data can be an effective tool for LRTIs diagnosis.