Development and Validation of a Machine Learning Approach for Automated Severity Assessment of COVID-19 Based on Clinical and Imaging Data: Retrospective Study

Quiroz, Juan Carlos; Feng, You-Zhen; Cheng, Zhong-Yuan; Rezazadegan, Dana; Chen, Ping-Kang; Lin, Qi-Ting; Qian, Long; Liu, Xiao-Fang; Berkovsky, Shlomo; Coiera, Enrico; Song, Lei; Qiu, Xiaoming; Liu, Sidong; Cai, Xiang-Ran

doi:10.2196/24572

JMIR Medical Informatics (Feb 2021)

Development and Validation of a Machine Learning Approach for Automated Severity Assessment of COVID-19 Based on Clinical and Imaging Data: Retrospective Study

Quiroz, Juan Carlos,
Feng, You-Zhen,
Cheng, Zhong-Yuan,
Rezazadegan, Dana,
Chen, Ping-Kang,
Lin, Qi-Ting,
Qian, Long,
Liu, Xiao-Fang,
Berkovsky, Shlomo,
Coiera, Enrico,
Song, Lei,
Qiu, Xiaoming,
Liu, Sidong,
Cai, Xiang-Ran

Affiliations

Quiroz, Juan Carlos
Feng, You-Zhen
Cheng, Zhong-Yuan
Rezazadegan, Dana
Chen, Ping-Kang
Lin, Qi-Ting
Qian, Long
Liu, Xiao-Fang
Berkovsky, Shlomo
Coiera, Enrico
Song, Lei
Qiu, Xiaoming
Liu, Sidong
Cai, Xiang-Ran

DOI: https://doi.org/10.2196/24572
Journal volume & issue: Vol. 9, no. 2
p. e24572

Abstract

Read online

BackgroundCOVID-19 has overwhelmed health systems worldwide. It is important to identify severe cases as early as possible, such that resources can be mobilized and treatment can be escalated. ObjectiveThis study aims to develop a machine learning approach for automated severity assessment of COVID-19 based on clinical and imaging data. MethodsClinical data—including demographics, signs, symptoms, comorbidities, and blood test results—and chest computed tomography scans of 346 patients from 2 hospitals in the Hubei Province, China, were used to develop machine learning models for automated severity assessment in diagnosed COVID-19 cases. We compared the predictive power of the clinical and imaging data from multiple machine learning models and further explored the use of four oversampling methods to address the imbalanced classification issue. Features with the highest predictive power were identified using the Shapley Additive Explanations framework. ResultsImaging features had the strongest impact on the model output, while a combination of clinical and imaging features yielded the best performance overall. The identified predictive features were consistent with those reported previously. Although oversampling yielded mixed results, it achieved the best model performance in our study. Logistic regression models differentiating between mild and severe cases achieved the best performance for clinical features (area under the curve [AUC] 0.848; sensitivity 0.455; specificity 0.906), imaging features (AUC 0.926; sensitivity 0.818; specificity 0.901), and a combination of clinical and imaging features (AUC 0.950; sensitivity 0.764; specificity 0.919). The synthetic minority oversampling method further improved the performance of the model using combined features (AUC 0.960; sensitivity 0.845; specificity 0.929). ConclusionsClinical and imaging features can be used for automated severity assessment of COVID-19 and can potentially help triage patients with COVID-19 and prioritize care delivery to those at a higher risk of severe disease.

Published in JMIR Medical Informatics

ISSN: 2291-9694 (Online)
Publisher: JMIR Publications
Country of publisher: Canada
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: https://medinform.jmir.org

About the journal