Computational and Structural Biotechnology Journal (Jan 2021)

Eleven routine clinical features predict COVID-19 severity uncovered by machine learning of longitudinal measurements

  • Kai Zhou,
  • Yaoting Sun,
  • Lu Li,
  • Zelin Zang,
  • Jing Wang,
  • Jun Li,
  • Junbo Liang,
  • Fangfei Zhang,
  • Qiushi Zhang,
  • Weigang Ge,
  • Hao Chen,
  • Xindong Sun,
  • Liang Yue,
  • Xiaomai Wu,
  • Bo Shen,
  • Jiaqin Xu,
  • Hongguo Zhu,
  • Shiyong Chen,
  • Hai Yang,
  • Shigao Huang,
  • Minfei Peng,
  • Dongqing Lv,
  • Chao Zhang,
  • Haihong Zhao,
  • Luxiao Hong,
  • Zhehan Zhou,
  • Haixiao Chen,
  • Xuejun Dong,
  • Chunyu Tu,
  • Minghui Li,
  • Yi Zhu,
  • Baofu Chen,
  • Stan Z. Li,
  • Tiannan Guo

Journal volume & issue
Vol. 19
pp. 3640 – 3649

Abstract

Read online

Severity prediction of COVID-19 remains one of the major clinical challenges for the ongoing pandemic. Here, we have recruited a 144 COVID-19 patient cohort, resulting in a data matrix containing 3,065 readings for 124 types of measurements over 52 days. A machine learning model was established to predict the disease progression based on the cohort consisting of training, validation, and internal test sets. A panel of eleven routine clinical factors constructed a classifier for COVID-19 severity prediction, achieving accuracy of over 98% in the discovery set. Validation of the model in an independent cohort containing 25 patients achieved accuracy of 80%. The overall sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were 0.70, 0.99, 0.93, and 0.93, respectively. Our model captured predictive dynamics of lactate dehydrogenase (LDH) and creatine kinase (CK) while their levels were in the normal range. This model is accessible at https://www.guomics.com/covidAI/ for research purpose.

Keywords