MixSegNet: A Novel Crack Segmentation Network Combining CNN and Transformer

Yang Zhou; Raza Ali; Norrima Mokhtar; Sulaiman Wadi Harun; Masahiro Iwahashi

doi:10.1109/ACCESS.2024.3438112

IEEE Access (Jan 2024)

MixSegNet: A Novel Crack Segmentation Network Combining CNN and Transformer

Yang Zhou,
Raza Ali,
Norrima Mokhtar,
Sulaiman Wadi Harun,
Masahiro Iwahashi

Affiliations

Yang Zhou: ORCiD; Department of Electrical Engineering, Faculty of Engineering, Universiti Malaya, Kuala Lumpur, Malaysia
Raza Ali: ORCiD; Department of Electrical Engineering, Faculty of Information and Communication Technology (FICT), Balochistan University of Information Technology, Engineering and Management Sciences (BUITEMS), Quetta, Pakistan
Norrima Mokhtar: ORCiD; Department of Electrical Engineering, Faculty of Engineering, Universiti Malaya, Kuala Lumpur, Malaysia
Sulaiman Wadi Harun: ORCiD; Department of Electrical Engineering, Faculty of Engineering, Universiti Malaya, Kuala Lumpur, Malaysia
Masahiro Iwahashi: ORCiD; Department of Electrical, Electronics and Information Engineering, Nagaoka University of Technology, Nagaoka, Japan

DOI: https://doi.org/10.1109/ACCESS.2024.3438112
Journal volume & issue: Vol. 12
pp. 111535 – 111545

Abstract

Read online

In the domain of road inspection and structural health monitoring, precise crack identification and segmentation are essential for structural safety and disaster prediction. Traditional image processing technologies encounter difficulties in detecting cracks due to their morphological diversity and complex background noise. This results in low detection accuracy and poor generalization. To overcome these challenges, this paper introduces MixSegNet, a novel deep learning model that enhances crack recognition and segmentation by integrating multi-scale features and deep feature learning. MixSegNet integrates convolutional neural networks (CNNs) and transformer architectures to enhance the detection of small cracks through the extraction and fusion of fine-grained features. Comparative evaluations against mainstream models, including LRASPP, U-Net, Deeplabv3, Swin-UNet, AttuNet, and FCN, demonstrate that MixSegNet achieves superior performance on open-source datasets. Specifically, the model achieved a precision of 95.2%, a recall of 88.2%, an F1 score of 91.5%, and a mean intersection over union (mIoU) of 84.8%, thereby demonstrating its effectiveness and reliability for crack segmentation tasks.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords