Critical Care Explorations (Jul 2024)

Development and Validation of a Machine Learning COVID-19 Veteran (COVet) Deterioration Risk Score

  • Sushant Govindan, MD, MSc,
  • Alexandra Spicer, MS,
  • Matthew Bearce, MPS,
  • Richard S. Schaefer, PharmD,
  • Andrea Uhl, BA,
  • Gil Alterovitz, PhD,
  • Michael J. Kim, MD,
  • Kyle A. Carey, MPH,
  • Nirav S. Shah, MD, MPH,
  • Christopher Winslow, MD,
  • Emily Gilbert, MD,
  • Anne Stey, MD, MSc,
  • Alan M. Weiss, MD,
  • Devendra Amin, MD,
  • George Karway, PhD,
  • Jennie Martin, MS,
  • Dana P. Edelson, MD,
  • Matthew M. Churpek, MD, MPH, PhD

DOI
https://doi.org/10.1097/CCE.0000000000001116
Journal volume & issue
Vol. 6, no. 7
p. e1116

Abstract

Read online

BACKGROUND AND OBJECTIVE:. To develop the COVid Veteran (COVet) score for clinical deterioration in Veterans hospitalized with COVID-19 and further validate this model in both Veteran and non-Veteran samples. No such score has been derived and validated while incorporating a Veteran sample. DERIVATION COHORT:. Adults (age ≥ 18 yr) hospitalized outside the ICU with a diagnosis of COVID-19 for model development to the Veterans Health Administration (VHA) (n = 80 hospitals). VALIDATION COHORT:. External validation occurred in a VHA cohort of 34 hospitals, as well as six non-Veteran health systems for further external validation (n = 21 hospitals) between 2020 and 2023. PREDICTION MODEL:. eXtreme Gradient Boosting machine learning methods were used, and performance was assessed using the area under the receiver operating characteristic curve and compared with the National Early Warning Score (NEWS). The primary outcome was transfer to the ICU or death within 24 hours of each new variable observation. Model predictor variables included demographics, vital signs, structured flowsheet data, and laboratory values. RESULTS:. A total of 96,908 admissions occurred during the study period, of which 59,897 were in the Veteran sample and 37,011 were in the non-Veteran sample. During external validation in the Veteran sample, the model demonstrated excellent discrimination, with an area under the receiver operating characteristic curve of 0.88. This was significantly higher than NEWS (0.79; p < 0.01). In the non-Veteran sample, the model also demonstrated excellent discrimination (0.86 vs. 0.79 for NEWS; p < 0.01). The top three variables of importance were eosinophil percentage, mean oxygen saturation in the prior 24-hour period, and worst mental status in the prior 24-hour period. CONCLUSIONS:. We used machine learning methods to develop and validate a highly accurate early warning score in both Veterans and non-Veterans hospitalized with COVID-19. The model could lead to earlier identification and therapy, which may improve outcomes.