Improved Colorectal Polyp Segmentation Using Enhanced MA-NET and Modified Mix-ViT Transformer

Khaled Elkarazle; Valliappan Raman; Patrick Then; Caslon Chua

doi:10.1109/ACCESS.2023.3291783

IEEE Access (Jan 2023)

Improved Colorectal Polyp Segmentation Using Enhanced MA-NET and Modified Mix-ViT Transformer

Khaled Elkarazle,
Valliappan Raman,
Patrick Then,
Caslon Chua

Affiliations

Khaled Elkarazle: ORCiD; Faculty of Engineering, Computing and Science, Swinburne University of Technology, Sarawak Campus, Kuching, Sarawak, Malaysia
Valliappan Raman: Department of Artificial Intelligence and Data Science, Coimbatore Institute of Technology, Coimbatore, Tamil Nadu, India
Patrick Then: ORCiD; Faculty of Engineering, Computing and Science, Swinburne University of Technology, Sarawak Campus, Kuching, Sarawak, Malaysia
Caslon Chua: Faculty of Science, Engineering and Technology, Swinburne University of Technology, Melbourne, VIC, Australia

DOI: https://doi.org/10.1109/ACCESS.2023.3291783
Journal volume & issue: Vol. 11
pp. 69295 – 69309

Abstract

Read online

Colorectal polyps is a prevalent medical condition that could lead to colorectal cancer, a leading cause of cancer-related mortality globally, if left undiagnosed. Colonoscopy remains the gold standard for detection and diagnosis of colorectal neoplasia; however, a significant proportion of neoplastic lesions are missed during routine examinations, particularly diminutive and flat lesions. Deep learning techniques have been employed to improve polyp detection rates in colonoscopy images and have proven successful in reducing the miss rate. However, accurate segmentation of small and flat polyps remains a major challenge to existing models as they struggle to differentiate polypoid and non-polypoid regions apart. To address this issue, we present an enhanced version of the Multi-Scale Attention Network (MA-NET) that incorporates a modified Mix-ViT transformer as the feature extractor. The modified Mix-ViT facilitates ultra-fine-grained visual categorization to improve the segmentation accuracy of polypoid and non-polypoid regions. Additionally, we introduce a pre-processing layer that performs histogram equalization on input images in the CIEL $^{\ast} \text{A}^{\ast} \text{B}^{\ast} $ color space to enhance their features. Our model was trained on a combined dataset comprising Kvasir-SEG and CVC-ClinicDB and cross-validated on CVC-ColonDB and ETIS-LaribDB. The proposed method demonstrates superior performance compared to existing methods, particularly in the detection of small and flat polyps.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords