BMC Medical Informatics and Decision Making (Mar 2023)

Machine learning prediction for COVID-19 disease severity at hospital admission

  • Ganesh Raman,
  • Bilal Ashraf,
  • Yusuf Kemal Demir,
  • Corey D. Kershaw,
  • Sreekanth Cheruku,
  • Murat Atis,
  • Ahsen Atis,
  • Mustafa Atar,
  • Weina Chen,
  • Ibrahim Ibrahim,
  • Taha Bat,
  • Mutlu Mete

DOI
https://doi.org/10.1186/s12911-023-02132-4
Journal volume & issue
Vol. 23, no. 1
pp. 1 – 6

Abstract

Read online

Abstract Importance Early prognostication of patients hospitalized with COVID-19 who may require mechanical ventilation and have worse outcomes within 30 days of admission is useful for delivering appropriate clinical care and optimizing resource allocation. Objective To develop machine learning models to predict COVID-19 severity at the time of the hospital admission based on a single institution data. Design, setting, and participants We established a retrospective cohort of patients with COVID-19 from University of Texas Southwestern Medical Center from May 2020 to March 2022. Easily accessible objective markers including basic laboratory variables and initial respiratory status were assessed using Random Forest’s feature importance score to create a predictive risk score. Twenty-five significant variables were identified to be used in classification models. The best predictive models were selected with repeated tenfold cross-validation methods. Main outcomes and measures Among patients with COVID-19 admitted to the hospital, severity was defined by 30-day mortality (30DM) rates and need for mechanical ventilation. Results This was a large, single institution COVID-19 cohort including total of 1795 patients. The average age was 59.7 years old with diverse heterogeneity. 236 (13%) required mechanical ventilation and 156 patients (8.6%) died within 30 days of hospitalization. Predictive accuracy of each predictive model was validated with the 10-CV method. Random Forest classifier for 30DM model had 192 sub-trees, and obtained 0.72 sensitivity and 0.78 specificity, and 0.82 AUC. The model used to predict MV has 64 sub-trees and returned obtained 0.75 sensitivity and 0.75 specificity, and 0.81 AUC. Our scoring tool can be accessed at https://faculty.tamuc.edu/mmete/covid-risk.html . Conclusions and relevance In this study, we developed a risk score based on objective variables of COVID-19 patients within six hours of admission to the hospital, therefore helping predict a patient's risk of developing critical illness secondary to COVID-19.

Keywords