PeerJ (Mar 2024)

Assessing the risk of concurrent mycoplasma pneumoniae pneumonia in children with tracheobronchial tuberculosis: retrospective study

  • Lin Liu,
  • Jie Jiang,
  • Lei Wu,
  • De miao Zeng,
  • Can Yan,
  • Linlong Liang,
  • Jiayun Shi,
  • Qifang Xie

DOI
https://doi.org/10.7717/peerj.17164
Journal volume & issue
Vol. 12
p. e17164

Abstract

Read online Read online

Objective This study aimed to create a predictive model based on machine learning to identify the risk for tracheobronchial tuberculosis (TBTB) occurring alongside Mycoplasma pneumoniae pneumonia in pediatric patients. Methods Clinical data from 212 pediatric patients were examined in this retrospective analysis. This cohort included 42 individuals diagnosed with TBTB and Mycoplasma pneumoniae pneumonia (combined group) and 170 patients diagnosed with lobar pneumonia alone (pneumonia group). Three predictive models, namely XGBoost, decision tree, and logistic regression, were constructed, and their performances were assessed using the receiver’s operating characteristic (ROC) curve, precision-recall curve (PR), and decision curve analysis (DCA). The dataset was divided into a 7:3 ratio to test the first and second groups, utilizing them to validate the XGBoost model and to construct the nomogram model. Results The XGBoost highlighted eight significant signatures, while the decision tree and logistic regression models identified six and five signatures, respectively. The ROC analysis revealed an area under the curve (AUC) of 0.996 for XGBoost, significantly outperforming the other models (p < 0.05). Similarly, the PR curve demonstrated the superior predictive capability of XGBoost. DCA further confirmed that XGBoost offered the highest AIC (43.226), the highest average net benefit (0.764), and the best model fit. Validation efforts confirmed the robustness of the findings, with the validation groups 1 and 2 showing ROC and PR curves with AUC of 0.997, indicating a high net benefit. The nomogram model was shown to possess significant clinical value. Conclusion Compared to machine learning approaches, the XGBoost model demonstrated superior predictive efficacy in identifying pediatric patients at risk of concurrent TBTB and Mycoplasma pneumoniae pneumonia. The model’s identification of critical signatures provides valuable insights into the pathogenesis of these conditions.

Keywords