Frontiers in Cellular and Infection Microbiology (Apr 2025)

Enhancing fever of unknown origin diagnosis: machine learning approaches to predict metagenomic next-generation sequencing positivity

  • Zhi Gao,
  • Zhi Gao,
  • Zhi Gao,
  • Yongfang Jiang,
  • Yongfang Jiang,
  • Yongfang Jiang,
  • Mengxuan Chen,
  • Mengxuan Chen,
  • Mengxuan Chen,
  • Weihang Wang,
  • Weihang Wang,
  • Weihang Wang,
  • Qiyao Liu,
  • Qiyao Liu,
  • Qiyao Liu,
  • Jing Ma,
  • Jing Ma,
  • Jing Ma

DOI
https://doi.org/10.3389/fcimb.2025.1550933
Journal volume & issue
Vol. 15

Abstract

Read online

ObjectiveMetagenomic next-generation sequencing (mNGS) can potentially detect various pathogenic microorganisms without bias to improve the diagnostic rate of fever of unknown origin (FUO), but there are no effective methods to predict mNGS-positive results. This study aimed to develop an interpretable machine learning algorithm for the effective prediction of mNGS results in patients with FUO.MethodsA clinical dataset from a large medical institution was used to develop and compare the performance of several predictive models, namely eXtreme Gradient Boosting (XGBoost), Light Gradient-Boosting Machine (LightGBM), and Random Forest, and the Shapley additive explanation (SHAP) method was employed to interpret and analyze the results.ResultsThe mNGS-positive rate among 284 patients with FUO reached 64.1%. Overall, the LightGBM-based model exhibited the best comprehensive predictive performance, with areas under the curve of 0.84 and 0.93 for the training and validation sets, respectively. Using the SHAP method, the five most important factors for predicting mNGS-positive results were albumin, procalcitonin, blood culture, disease type, and sample type.ConclusionThe validated LightGBM-based predictive model could have practical clinical value in enhancing the application of mNGS in the etiological diagnosis of FUO, representing a powerful tool to optimize the timing of mNGS.

Keywords