Wireless Capsule Endoscopy Image Classification: An Explainable AI Approach

Dara Varam; Rohan Mitra; Meriam Mkadmi; Radi Aman Riyas; Diaa Addeen Abuhani; Salam Dhou; Ayman Alzaatreh

doi:10.1109/ACCESS.2023.3319068

IEEE Access (Jan 2023)

Wireless Capsule Endoscopy Image Classification: An Explainable AI Approach

Dara Varam,
Rohan Mitra,
Meriam Mkadmi,
Radi Aman Riyas,
Diaa Addeen Abuhani,
Salam Dhou,
Ayman Alzaatreh

Affiliations

Dara Varam: ORCiD; Department of Computer Science and Engineering, American University of Sharjah, Sharjah, United Arab Emirates
Rohan Mitra: ORCiD; Department of Computer Science and Engineering, American University of Sharjah, Sharjah, United Arab Emirates
Meriam Mkadmi: ORCiD; Department of Computer Science and Engineering, American University of Sharjah, Sharjah, United Arab Emirates
Radi Aman Riyas: ORCiD; Department of Computer Science and Engineering, American University of Sharjah, Sharjah, United Arab Emirates
Diaa Addeen Abuhani: Department of Computer Science and Engineering, American University of Sharjah, Sharjah, United Arab Emirates
Salam Dhou: ORCiD; Department of Computer Science and Engineering, American University of Sharjah, Sharjah, United Arab Emirates
Ayman Alzaatreh: ORCiD; Department of Mathematics and Statistics, American University of Sharjah, Sharjah, United Arab Emirates

DOI: https://doi.org/10.1109/ACCESS.2023.3319068
Journal volume & issue: Vol. 11
pp. 105262 – 105280

Abstract

Read online

Deep Learning has contributed significantly to the advances made in the fields of Medical Imaging and Computer Aided Diagnosis (CAD). Although a variety of Deep Learning (DL) models exist for the purposes of image classification in the medical domain, more analysis needs to be conducted on their decision-making processes. For this reason, several novel Explainable AI (XAI) techniques have been proposed in recent years to better understand DL models. Currently, medical professionals rely on visual inspections to diagnose potential diseases in endoscopic imaging in the preliminary stages. However, we believe that the use of automated systems can enhance both the efficiency for such diagnoses. The aim of this study is to increase the reliability of model predictions within the field of endoscopic imaging by implementing several transfer learning models on a balanced subset of Kvasir-capsule, a Wireless Capsule Endoscopy imaging dataset. This subset includes the top 9 classes of the dataset for training and testing. The results obtained were an F1-score of 97% ±1% for the Vision Transformer model, although other models such as MobileNetv3Large and ResNet152v2 were also able to achieve F1-scores of over 90%. These are currently the highest-reported metrics on this data, improving upon prior studies done on the same dataset. The heatmaps of several XAI techniques, including GradCAM, GradCAM++, LayersCAM, LIME, and SHAP have been presented in image form and evaluated according to their highlighted regions of importance. This is in an effort to better understand the decisions of the top-performing DL models and look beyond their black-box nature.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords