CSC-Unet: A Novel Convolutional Sparse Coding Strategy Based Neural Network for Semantic Segmentation

Haitong Tang; Shuang He; Mengduo Yang; Xia Lu; Qin Yu; Kaiyue Liu; Hongjie Yan; Nizhuan Wang

doi:10.1109/ACCESS.2024.3373619

IEEE Access (Jan 2024)

CSC-Unet: A Novel Convolutional Sparse Coding Strategy Based Neural Network for Semantic Segmentation

Haitong Tang,
Shuang He,
Mengduo Yang,
Xia Lu,
Qin Yu,
Kaiyue Liu,
Hongjie Yan,
Nizhuan Wang

Affiliations

Haitong Tang: School of Geomatics and Marine Information, Jiangsu Ocean University, Lianyungang, China
Shuang He: School of Geomatics and Marine Information, Jiangsu Ocean University, Lianyungang, China
Mengduo Yang: School of Information Technology, Suzhou Institute of Trade and Commerce, Suzhou, China
Xia Lu: School of Geography Science and Geomatics Engineering, Suzhou University of Science and Technology, Suzhou, China
Qin Yu: School of Computer Engineering, Jiangsu Ocean University, Lianyungang, China
Kaiyue Liu: School of Geomatics and Marine Information, Jiangsu Ocean University, Lianyungang, China
Hongjie Yan: Department of Neurology, Affiliated Lianyungang Hospital of Xuzhou Medical University, Lianyungang, China
Nizhuan Wang: ORCiD; School of Geomatics and Marine Information, Jiangsu Ocean University, Lianyungang, China

DOI: https://doi.org/10.1109/ACCESS.2024.3373619
Journal volume & issue: Vol. 12
pp. 35844 – 35854

Abstract

Read online

It is still a challenging task to perform the semantic segmentation with high accuracy due to the complexity of real picture scenes. Many semantic segmentation methods based on traditional deep learning insufficiently captured the semantic and appearance information of images, which put limit on their generality and robustness for various application scenes. Thus, in this paper, we proposed a novel strategy that reformulated the popularly used convolution operation to multi-layer convolutional sparse coding block in semantic segmentation method to ease the aforementioned deficiency. To prove the effectiveness of our idea, we chose the widely used U-Net model for the demonstration purpose, and we designed CSC-Unet model series based on U-Net. Through extensive analysis and experiments, we provided credible evidence showing that the multi-layer convolutional sparse coding block enables semantic segmentation model to converge faster, extract finer semantic and appearance information of images, and improve the ability to recover spatial detail information. The best CSC-Unet model significantly outperforms the results of the original U-Net on three public datasets with different scenarios, i.e., 87.14% vs. 84.71% on DeepCrack dataset, 68.91% vs. 67.09% on Nuclei dataset, and 53.68% vs. 48.82% on CamVid dataset, respectively. In addition, the proposed strategy could be possibly used to significantly improve segmentation performance of any semantic segmentation model that involves convolution operations and the corresponding code is available at https://github.com/NZWANG/CSC-Unet.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords