Transformative Transparent Hybrid Deep Learning Framework for Accurate Cataract Detection

Julius Olaniyan; Deborah Olaniyan; Ibidun Christiana Obagbuwa; Bukohwo Michael Esiefarienrhe; Matthew Odighi

doi:10.3390/app142110041

Applied Sciences (Nov 2024)

Transformative Transparent Hybrid Deep Learning Framework for Accurate Cataract Detection

Julius Olaniyan,
Deborah Olaniyan,
Ibidun Christiana Obagbuwa,
Bukohwo Michael Esiefarienrhe,
Matthew Odighi

Affiliations

Julius Olaniyan: Department of Computer Science, Bowen University, Iwo PMB 284, Nigeria
Deborah Olaniyan: Department of Computer Science, Bowen University, Iwo PMB 284, Nigeria
Ibidun Christiana Obagbuwa: Department of Computer Science, Sol-Plaatje University, Kimberley 8301, South Africa
Bukohwo Michael Esiefarienrhe: Department of Computer Science, North-West University, Mafikeng X2046, South Africa
Matthew Odighi: Department of Computer Science, Ambrose Alli University, Ekpoma PMB 14, Nigeria

DOI: https://doi.org/10.3390/app142110041
Journal volume & issue: Vol. 14, no. 21
p. 10041

Abstract

Read online

This paper presents a transformative explainable convolutional neural network (CNN) framework for cataract detection, utilizing a hybrid deep learning model combining Siamese networks with VGG16. By leveraging a learning rate scheduler and Grad-CAM (Gradient-weighted Class Activation Mapping) for explainability, the proposed model not only achieves high accuracy in identifying cataract-infected images but also provides interpretable visual explanations of its predictions. Performance evaluation metrics such as accuracy, precision, recall, and F1 score demonstrate the model’s robustness, with a perfect accuracy of 100%. Grad-CAM visualizations highlight the key image regions—primarily around the iris and pupil—that contribute most to the model’s decision-making, making the system more transparent for clinical use. Additionally, novel statistical analysis methods, including saliency map evaluation metrics like AUC (Area Under the Curve) and the Pointing Game, were employed to quantify the quality of the model’s explanations. These metrics enhance the interpretability of the model and support its practical applicability in medical image analysis. This approach advances the integration of deep learning with explainable AI, offering a robust, accurate, and interpretable solution for cataract detection with the potential for broader adoption in ocular disease diagnosis and medical decision support systems.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords