Uncovering the Black Box of Coronary Artery Disease Diagnosis: The Significance of Explainability in Predictive Models

Agorastos-Dimitrios Samaras; Serafeim Moustakidis; Ioannis D. Apostolopoulos; Elpiniki Papageorgiou; Nikolaos Papandrianos

doi:10.3390/app13148120

Applied Sciences (Jul 2023)

Uncovering the Black Box of Coronary Artery Disease Diagnosis: The Significance of Explainability in Predictive Models

Agorastos-Dimitrios Samaras,
Serafeim Moustakidis,
Ioannis D. Apostolopoulos,
Elpiniki Papageorgiou,
Nikolaos Papandrianos

Affiliations

Agorastos-Dimitrios Samaras: Department of Energy Systems, University of Thessaly, Gaiopolis Campus, 41500 Larisa, Greece
Serafeim Moustakidis: Department of Energy Systems, University of Thessaly, Gaiopolis Campus, 41500 Larisa, Greece
Ioannis D. Apostolopoulos: Department of Energy Systems, University of Thessaly, Gaiopolis Campus, 41500 Larisa, Greece
Elpiniki Papageorgiou: Department of Energy Systems, University of Thessaly, Gaiopolis Campus, 41500 Larisa, Greece
Nikolaos Papandrianos: Department of Energy Systems, University of Thessaly, Gaiopolis Campus, 41500 Larisa, Greece

DOI: https://doi.org/10.3390/app13148120
Journal volume & issue: Vol. 13, no. 14
p. 8120

Abstract

Read online

In recent times, coronary artery disease (CAD) prediction and diagnosis have been the subject of many Medical decision support systems (MDSS) that make use of machine learning (ML) and deep learning (DL) algorithms. The common ground of most of these applications is that they function as black boxes. They reach a conclusion/diagnosis using multiple features as input; however, the user is oftentimes oblivious to the prediction process and the feature weights leading to the eventual prediction. The primary objective of this study is to enhance the transparency and comprehensibility of a black-box prediction model designed for CAD. The dataset employed in this research comprises biometric and clinical information obtained from 571 patients, encompassing 21 different features. Among the instances, 43% of cases of CAD were confirmed through invasive coronary angiography (ICA). Furthermore, a prediction model utilizing the aforementioned dataset and the CatBoost algorithm is analyzed to highlight its prediction making process and the significance of each input datum. State-of-the-art explainability mechanics are employed to highlight the significance of each feature, and common patterns and differences with the medical bibliography are then discussed. Moreover, the findings are compared with common risk factors for CAD, to offer an evaluation of the prediction process from the medical expert’s point of view. By depicting how the algorithm weights the information contained in features, we shed light on the black-box mechanics of ML prediction models; by analyzing the findings, we explore their validity in accordance with the medical literature on the matter.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords