PLoS ONE (Jan 2022)

MFEAFN: Multi-scale feature enhanced adaptive fusion network for image semantic segmentation.

  • Shusheng Li,
  • Liang Wan,
  • Lu Tang,
  • Zhining Zhang

DOI
https://doi.org/10.1371/journal.pone.0274249
Journal volume & issue
Vol. 17, no. 9
p. e0274249

Abstract

Read online

Low-level features contain spatial detail information, and high-level features contain rich semantic information. Semantic segmentation research focuses on fully acquiring and effectively fusing spatial detail with semantic information. This paper proposes a multiscale feature-enhanced adaptive fusion network named MFEAFN to improve semantic segmentation performance. First, we designed a Double Spatial Pyramid Module named DSPM to extract more high-level semantic information. Second, we designed a Focusing Selective Fusion Module named FSFM to fuse different scales and levels of feature maps. Specifically, the feature maps are enhanced to adaptively fuse these features by generating attention weights through a spatial attention mechanism and a two-dimensional discrete cosine transform, respectively. To validate the effectiveness of FSFM, we designed different fusion modules for comparison and ablation experiments. MFEAFN achieved 82.64% and 78.46% mIoU on the PASCAL VOC2012 and Cityscapes datasets. In addition, our method has better segmentation results than state-of-the-art methods.