Diagnostics (Nov 2022)

Prognostic Model of COVID-19 Severity and Survival among Hospitalized Patients Using Machine Learning Techniques

  • Ivano Lodato,
  • Aditya Varna Iyer,
  • Isaac Zachary To,
  • Zhong-Yuan Lai,
  • Helen Shuk-Ying Chan,
  • Winnie Suk-Wai Leung,
  • Tommy Hing-Cheung Tang,
  • Victor Kai-Lam Cheung,
  • Tak-Chiu Wu,
  • George Wing-Yiu Ng

DOI
https://doi.org/10.3390/diagnostics12112728
Journal volume & issue
Vol. 12, no. 11
p. 2728

Abstract

Read online

We conducted a statistical study and developed a machine learning model to triage COVID-19 patients affected during the height of the COVID-19 pandemic in Hong Kong based on their medical records and test results (features) collected during their hospitalization. The correlation between the values of these features is studied against discharge status and disease severity as a preliminary step to identify those features with a more pronounced effect on the patient outcome. Once identified, they constitute the inputs of four machine learning models, Decision Tree, Random Forest, Gradient and RUSBoosting, which predict both the Mortality and Severity associated with the disease. We test the accuracy of the models when the number of input features is varied, demonstrating their stability; i.e., the models are already highly predictive when run over a core set of (6) features. We show that Random Forest and Gradient Boosting classifiers are highly accurate in predicting patients’ Mortality (average accuracy ∼99%) as well as categorize patients (average accuracy ∼91%) into four distinct risk classes (Severity of COVID-19 infection). Our methodical and broad approach combines statistical insights with various machine learning models, which paves the way forward in the AI-assisted triage and prognosis of COVID-19 cases, which is potentially generalizable to other seasonal flus.

Keywords