Prediction of 3-Year All-Cause Death in a Percutaneous Coronary Intervention Registry using Machine Learning: A Comparison Between Random Forest and CatBoost Algorithms

Paul-Adrian CĂLBUREAN; Paul GREBENIŞAN; Victor VACARIU; Reka-Katalin DRINCAL; Oana ŢEPES; Iulia GRANCEA; Ioana ŞUŞ; Cristina SOMKEREKI; Valentin SIMON; Zoltán DEMJÉN; István ADORJÁN; Irina PINITILIE; Anca Teodora DOLCOŞ; Tiberiu OLTEAN; László HADADI; Marius MĂRUŞTERI

Applied Medical Informatics (Sep 2021)

Prediction of 3-Year All-Cause Death in a Percutaneous Coronary Intervention Registry using Machine Learning: A Comparison Between Random Forest and CatBoost Algorithms

Paul-Adrian CĂLBUREAN,
Paul GREBENIŞAN,
Victor VACARIU,
Reka-Katalin DRINCAL,
Oana ŢEPES,
Iulia GRANCEA,
Ioana ŞUŞ,
Cristina SOMKEREKI,
Valentin SIMON,
Zoltán DEMJÉN,
István ADORJÁN,
Irina PINITILIE,
Anca Teodora DOLCOŞ,
Tiberiu OLTEAN,
László HADADI,
Marius MĂRUŞTERI

Affiliations

Paul-Adrian CĂLBUREAN: UMFST George Emil Palade Targu Mures
Paul GREBENIŞAN: Emergency Institute for Cardiovascular Diseases and Transplantation Târgu Mureş
Victor VACARIU: Emergency Institute for Cardiovascular Diseases and Transplantation Târgu Mureş
Reka-Katalin DRINCAL: Emergency Institute for Cardiovascular Diseases and Transplantation Târgu Mureş
Oana ŢEPES: Emergency Institute for Cardiovascular Diseases and Transplantation Târgu Mureş
Iulia GRANCEA: Emergency Institute for Cardiovascular Diseases and Transplantation Târgu Mureş
Ioana ŞUŞ: Emergency Institute for Cardiovascular Diseases and Transplantation Târgu Mureş
Cristina SOMKEREKI: Emergency Institute for Cardiovascular Diseases and Transplantation Târgu Mureş
Valentin SIMON: Emergency Institute for Cardiovascular Diseases and Transplantation Târgu Mureş
Zoltán DEMJÉN: Emergency Institute for Cardiovascular Diseases and Transplantation Târgu Mureş
István ADORJÁN: Emergency Institute for Cardiovascular Diseases and Transplantation Târgu Mureş
Irina PINITILIE: Emergency Institute for Cardiovascular Diseases and Transplantation Târgu Mureş
Anca Teodora DOLCOŞ: Emergency Institute for Cardiovascular Diseases and Transplantation Târgu Mureş
Tiberiu OLTEAN: Emergency Institute for Cardiovascular Diseases and Transplantation Târgu Mureş
László HADADI: Emergency Institute for Cardiovascular Diseases and Transplantation Târgu Mureş
Marius MĂRUŞTERI: UMFST George Emil Palade Targu Mures

Journal volume & issue: Vol. 43, no. Suppl. S1
pp. 21 – 21

Abstract

Read online

Background and Aim: Risk stratification in patients undergoing percutaneous coronary intervention (PCI) procedures is a major objective in clinical practice since it guides appropriate therapy selection. Machine learning (ML) models are complex automated decision systems and numerous algorithms have been developed. Our aim was to compare random forest, a traditional ML algorithm, with gradient boosting with categorical features support (CatBoost), a newer ML algorithm, in predicting 3-year all-cause death in a PCI population. Materials and Methods: All patients older than 18 years and treated by PCI in a tertiary care centre between January 2016 – December 2017 have been included after hospital discharge in this registry. Mortality rates at 3 years were documented from the Romanian National Health Insurance System database. A total of 120 clinical variables were used to train the two ML algorithms. Training was performed on 70% of the dataset and testing was performed on the remaining 30% of the dataset. Results: A total of 2242 patients were included, of which 336 (14.9%) were deceased at 3-year follow-up. Area under receiver-operator curve for 3-year all-cause mortality prediction for CatBoost was 0.848, while for random forest was 0.802 (DeLong p=0.001). Three most important clinical variables for both ML models were age, left ventricular ejection fraction and serum creatinine. Brier scores for random forest and CatBoost were 0.121 and 0.102 respectively, indicating a good fit of the ML-based models. Conclusions: Among aggregated decision trees ML algorithms, CatBoost has superior predictive capacity of adverse clinical events in a PCI population when compared with random forest.

Published in Applied Medical Informatics

ISSN: 1224-5593 (Print); 2067-7855 (Online)
Publisher: Iuliu Hatieganu University of Medicine and Pharmacy, Cluj-Napoca
Country of publisher: Romania
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: https://ami.info.umfcluj.ro/index.php/AMI

About the journal

Abstract

Keywords