Scientific Reports (Nov 2021)

An interpretable machine learning model based on a quick pre-screening system enables accurate deterioration risk prediction for COVID-19

  • Lijing Jia,
  • Zijian Wei,
  • Heng Zhang,
  • Jiaming Wang,
  • Ruiqi Jia,
  • Manhong Zhou,
  • Xueyan Li,
  • Hankun Zhang,
  • Xuedong Chen,
  • Zheyuan Yu,
  • Zhaohong Wang,
  • Xiucheng Li,
  • Tingting Li,
  • Xiangge Liu,
  • Pei Liu,
  • Wei Chen,
  • Jing Li,
  • Kunlun He

DOI
https://doi.org/10.1038/s41598-021-02370-4
Journal volume & issue
Vol. 11, no. 1
pp. 1 – 16

Abstract

Read online

Abstract A high-performing interpretable model is proposed to predict the risk of deterioration in coronavirus disease 2019 (COVID-19) patients. The model was developed using a cohort of 3028 patients diagnosed with COVID-19 and exhibiting common clinical symptoms that were internally verified (AUC 0.8517, 95% CI 0.8433, 0.8601). A total of 15 high risk factors for deterioration and their approximate warning ranges were identified. This included prothrombin time (PT), prothrombin activity, lactate dehydrogenase, international normalized ratio, heart rate, body-mass index (BMI), D-dimer, creatine kinase, hematocrit, urine specific gravity, magnesium, globulin, activated partial thromboplastin time, lymphocyte count (L%), and platelet count. Four of these indicators (PT, heart rate, BMI, HCT) and comorbidities were selected for a streamlined combination of indicators to produce faster results. The resulting model showed good predictive performance (AUC 0.7941 95% CI 0.7926, 0.8151). A website for quick pre-screening online was also developed as part of the study.