Construction of a Risk Prediction Model for Hospital-Acquired Pulmonary Embolism in Hospitalized Patients

Lengchen Hou PhD; Longjun Hu PhD; Wenxue Gao MD; Wenbo Sheng PhD; Zedong Hao PhD; Yiwei Chen MPH; Jiyu Li PhD

doi:10.1177/10760296211040868

Clinical and Applied Thrombosis/Hemostasis (Sep 2021)

Construction of a Risk Prediction Model for Hospital-Acquired Pulmonary Embolism in Hospitalized Patients

Lengchen Hou PhD,
Longjun Hu PhD,
Wenxue Gao MD,
Wenbo Sheng PhD,
Zedong Hao PhD,
Yiwei Chen MPH,
Jiyu Li PhD

Affiliations

Lengchen Hou PhD: *As co-first authors, the two authors have an equally important contribution to this research.
Longjun Hu PhD: *As co-first authors, the two authors have an equally important contribution to this research.
Wenxue Gao MD: Shanghai Tenth People's Hospital, Shanghai, China
Wenbo Sheng PhD: Shanghai Synyi Medical Technology Co., Ltd, Shanghai, China
Zedong Hao PhD: Shanghai Synyi Medical Technology Co., Ltd, Shanghai, China
Yiwei Chen MPH: Shanghai Synyi Medical Technology Co., Ltd, Shanghai, China
Jiyu Li PhD: Shanghai Tenth People's Hospital, Shanghai, China

DOI: https://doi.org/10.1177/10760296211040868
Journal volume & issue: Vol. 27

Abstract

Read online

The purpose of this study is to establish a novel pulmonary embolism (PE) risk prediction model based on machine learning (ML) methods and to evaluate the predictive performance of the model and the contribution of variables to the predictive performance. We conducted a retrospective study at the Shanghai Tenth People's Hospital and collected the clinical data of in-patients that received pulmonary computed tomography imaging between January 1, 2014 and December 31, 2018. We trained several ML models, including logistic regression (LR), support vector machine (SVM), random forest (RF), and gradient boosting decision tree (GBDT), compared the models with representative baseline algorithms, and investigated their predictability and feature interpretation. A total of 3619 patients were included in the study. We discovered that the GBDT model demonstrated the best prediction with an area under the curve value of 0.799, whereas those of the RF, LR, and SVM models were 0.791, 0.716, and 0.743, respectively. The sensibilities of the GBDT, LR, RF, and SVM models were 63.9%, 68.1%, 71.5%, and 75%, respectively; the specificities were 81.1%, 66.1, 72.7%, and 65.1%, respectively; and the accuracies were 77.8%, 66.5%, 72.5%, and 67%, respectively. We discovered that the maximum D-dimer level contributed the most to the outcome prediction, followed by the extreme growth rate of the plasma fibrinogen level, in-hospital duration, and extreme growth rate of the D-dimer level. The study demonstrates the superiority of the GBDT model in predicting the risk of PE in hospitalized patients. However, in order to be applied in clinical practice and provide support for clinical decision-making, the predictive performance of the model needs to be prospectively verified.

Published in Clinical and Applied Thrombosis/Hemostasis

ISSN: 1076-0296 (Print); 1938-2723 (Online)
Publisher: SAGE Publishing
Country of publisher: United States
LCC subjects: Medicine: Internal medicine: Specialties of internal medicine: Diseases of the circulatory (Cardiovascular) system
Website: https://journals.sagepub.com/home/cat

About the journal