CTH-Net: A CNN and Transformer hybrid network for skin lesion segmentation
Yuhan Ding,
Zhenglin Yi,
Jiatong Xiao,
Minghui Hu,
Yu Guo,
Zhifang Liao,
Yongjie Wang
Affiliations
Yuhan Ding
School of Computer Science and Engineering, Central South University, Changsha 410083, China
Zhenglin Yi
Departments of Urology, Xiangya Hospital, Central South University, Changsha 410008, China
Jiatong Xiao
Departments of Urology, Xiangya Hospital, Central South University, Changsha 410008, China
Minghui Hu
Departments of Urology, Xiangya Hospital, Central South University, Changsha 410008, China
Yu Guo
Department of Burns and Plastic Surgery, Xiangya Hospital, Central South University, Changsha 410008, China; National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha 410008, China
Zhifang Liao
School of Computer Science and Engineering, Central South University, Changsha 410083, China; Corresponding author
Yongjie Wang
Department of Burns and Plastic Surgery, Xiangya Hospital, Central South University, Changsha 410008, China; National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha 410008, China; Corresponding author
Summary: Automatically and accurately segmenting skin lesions can be challenging, due to factors such as low contrast and fuzzy boundaries. This paper proposes a hybrid encoder-decoder model (CTH-Net) based on convolutional neural network (CNN) and Transformer, capitalizing on the advantages of these approaches. We propose three modules for skin lesion segmentation and seamlessly connect them with carefully designed model architecture. Better segmentation performance is achieved by introducing SoftPool in the CNN branch and sandglass block in the bottleneck layer. Extensive experiments were conducted on four publicly accessible skin lesion datasets, ISIC 2016, ISIC 2017, ISIC 2018, and PH2 to confirm the efficacy and benefits of the proposed strategy. Experimental results show that the proposed CTH-Net provides better skin lesion segmentation performance in both quantitative and qualitative testing when compared with state-of-the-art approaches. We believe the CTH-Net design is inspiring and can be extended to other applications/frameworks.