IET Image Processing (Jan 2023)

FAFNet: Fully aligned fusion network for RGBD semantic segmentation based on hierarchical semantic flows

  • Jiazhou Chen,
  • Yangfan Zhan,
  • Yanghui Xu,
  • Xiang Pan

DOI
https://doi.org/10.1049/ipr2.12614
Journal volume & issue
Vol. 17, no. 1
pp. 32 – 41

Abstract

Read online

Abstract Depth maps are acquirable and irreplaceable geometric information that significantly enhances traditional color images. RGB and Depth (RGBD) images have been widely used in various image analysis applications, but they are still very limited due to challenges from different modalities and misalignment between color and depth. In this paper, a Fully Aligned Fusion Network (FAFNet) for RGBD semantic segmentation is presented. To improve cross‐modality fusion, a new RGBD fusion block is proposed, features from color images and depth maps are first fused by an attention cross fusion module and then aligned by a semantic flow. A multi‐layer structure is also designed to hierarchically utilize the RGBD fusion block, which not only eases issues of low resolution and noises for depth maps but also reduces the loss of semantic features in the upsampling process. Quantitative and qualitative evaluations on both the NYU‐Depth V2 and the SUN RGB‐D dataset demonstrate that the FAFNet model outperforms state‐of‐the‐art RGBD semantic segmentation methods.