Digital Health (Nov 2022)

Homogeneous ensemble models for predicting infection levels and mortality of COVID-19 patients: Evidence from China

  • Jiafeng Wang,
  • Xianlong Zhou,
  • Zhitian Hou,
  • Xiaoya Xu,
  • Yueyue Zhao,
  • Shanshan Chen,
  • Jun Zhang,
  • Lina Shao,
  • Rong Yan,
  • Mingshan Wang,
  • Minghua Ge,
  • Tianyong Hao,
  • Yuexing Tu,
  • Haijun Huang

DOI
https://doi.org/10.1177/20552076221133692
Journal volume & issue
Vol. 8

Abstract

Read online

Background Persistence of long-term COVID-19 pandemic is putting high pressure on healthcare services worldwide for several years. This article aims to establish models to predict infection levels and mortality of COVID-19 patients in China. Methods Machine learning models and deep learning models have been built based on the clinical features of COVID-19 patients. The best models are selected by area under the receiver operating characteristic curve (AUC) scores to construct two homogeneous ensemble models for predicting infection levels and mortality, respectively. The first-hand clinical data of 760 patients are collected from Zhongnan Hospital of Wuhan University between 3 January and 8 March 2020. We preprocess data with cleaning, imputation, and normalization. Results Our models obtain AUC = 0.7059 and Recall (Weighted avg) = 0.7248 in predicting infection level, while AUC=0.8436 and Recall (Weighted avg) = 0.8486 in predicting mortality ratio. This study also identifies two sets of essential clinical features. One is C-reactive protein (CRP) or high sensitivity C-reactive protein (hs-CRP) and the other is chest tightness, age, and pleural effusion. Conclusions Two homogeneous ensemble models are proposed to predict infection levels and mortality of COVID-19 patients in China. New findings of clinical features for benefiting the machine learning models are reported. The evaluation of an actual dataset collected from January 3 to March 8, 2020 demonstrates the effectiveness of the models by comparing them with state-of-the-art models in prediction.