IEEE Access (Jan 2024)

Mpox Virus Image Segmentation Based on Multiscale Expansion Convolution

  • Jian-Fei Ma,
  • Peng-Fei He,
  • Cheng-Lin Li,
  • Rong Nie

DOI
https://doi.org/10.1109/ACCESS.2024.3448364
Journal volume & issue
Vol. 12
pp. 117608 – 117616

Abstract

Read online

A novel strategy for segmenting mpox virus images is proposed to address the challenge of distinguishing lesion areas from muscle tissue and other regions of infection. This strategy leverages a multiscale inflated convolutional feature fusion and attentional Swin-Unet approach. In this method, a multi-scale extended convolution module is employed in the coding stage of the Swin-Unet network to enhance complementary features while preserving different features at different scales. Additionally, a triple attention module is integrated into the downsampling process to address the issue of inter-channel independence. Finally, the Swin Transformer Block is utilized to modulate the network segmentation performance by adjusting the iteration count in the encoding and decoding regions. Experimental results on a self-constructed mpox dataset demonstrate that the proposed network achieves a pixel segmentation accuracy of 90.4% and an average intersection-over-union ratio of 80.3%. These values represent improvements of 8.6% and 14.6%, respectively, compared to the original Swin-Unet. This enhancement provides valuable support for the ancillary diagnosis of mpox.

Keywords