SFA-Net: Semantic Feature Adjustment Network for Remote Sensing Image Segmentation

Gyutae Hwang; Jiwoo Jeong; Sang Jun Lee

doi:10.3390/rs16173278

Remote Sensing (Sep 2024)

SFA-Net: Semantic Feature Adjustment Network for Remote Sensing Image Segmentation

Gyutae Hwang,
Jiwoo Jeong,
Sang Jun Lee

Affiliations

Gyutae Hwang: Division of Electronic Engineering, Jeonbuk National University, Jeonju 54896, Republic of Korea
Jiwoo Jeong: Division of Electronic Engineering, Jeonbuk National University, Jeonju 54896, Republic of Korea
Sang Jun Lee: Future Semiconductor Convergence Technology Research Center, Division of Electronic Engineering, Jeonbuk National University, Jeonju 54896, Republic of Korea

DOI: https://doi.org/10.3390/rs16173278
Journal volume & issue: Vol. 16, no. 17
p. 3278

Abstract

Read online

Advances in deep learning and computer vision techniques have made impacts in the field of remote sensing, enabling efficient data analysis for applications such as land cover classification and change detection. Convolutional neural networks (CNNs) and transformer architectures have been utilized in visual perception algorithms due to their effectiveness in analyzing local features and global context. In this paper, we propose a hybrid transformer architecture that consists of a CNN-based encoder and transformer-based decoder. We propose a feature adjustment module that refines the multiscale feature maps extracted from an EfficientNet backbone network. The adjusted feature maps are integrated into the transformer-based decoder to perform the semantic segmentation of the remote sensing images. This paper refers to the proposed encoder–decoder architecture as a semantic feature adjustment network (SFA-Net). To demonstrate the effectiveness of the SFA-Net, experiments were thoroughly conducted with four public benchmark datasets, including the UAVid, ISPRS Potsdam, ISPRS Vaihingen, and LoveDA datasets. The proposed model achieved state-of-the-art accuracy on the UAVid, ISPRS Vaihingen, and LoveDA datasets for the segmentation of the remote sensing images. On the ISPRS Potsdam dataset, our method achieved comparable accuracy to the latest model while reducing the number of trainable parameters from 113.8 M to 10.7 M.

Published in Remote Sensing

ISSN: 2072-4292 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science
Website: http://www.mdpi.com/journal/remotesensing/

About the journal

Abstract

Keywords