CSFFNet: Lightweight cross‐scale feature fusion network for salient object detection in remote sensing images

Longbao Wang; Chong Long; Xin Li; Xiaodan Tang; Zhipeng Bai; Hongmin Gao

doi:10.1049/ipr2.12972

IET Image Processing (Feb 2024)

CSFFNet: Lightweight cross‐scale feature fusion network for salient object detection in remote sensing images

Longbao Wang,
Chong Long,
Xin Li,
Xiaodan Tang,
Zhipeng Bai,
Hongmin Gao

Affiliations

Longbao Wang: School of Computer and Information Hohai University Nanjing China
Chong Long: School of Computer and Information Hohai University Nanjing China
Xin Li: School of Computer and Information Hohai University Nanjing China
Xiaodan Tang: China Yangtze Power Co., Ltd Beijing China
Zhipeng Bai: China Yangtze Power Co., Ltd Beijing China
Hongmin Gao: School of Computer and Information Hohai University Nanjing China

DOI: https://doi.org/10.1049/ipr2.12972
Journal volume & issue: Vol. 18, no. 3
pp. 602 – 614

Abstract

Read online

Abstract Salient object detection (SOD), one of the most important applications in the field of computer vision, aims to extract the most visually appealing regions of scenes. However, the improvement of the accuracy of existing salient object detection in optical remote sensing images (ORSI‐SOD) is usually accompanied by an increase of network complexity, which affects the application of these models. Motivated by this, a novel lightweight edge‐supervised neural network for ORSI‐SOD is proposed, named CSFFNet. Specifically, the backbone (ResNet34) is first lightened by feature encoding module (FEM), building a lightweight subnet for feature extraction. Then, in the transformer‐based feature pyramid enhancement module (FPEM), the convolutional features obtained in the FEM are enhanced by long‐distance dependence to obtain multi‐scale features containing rich saliency cues. Based on this, the feature fusion module (FFM) is designed to capture cross‐scale long‐range dependencies and effectively fuse high‐level semantic information with low‐level detail information. Thus, the increase in network complexity due to multi‐level decoding is avoided. Finally, the segmentation results are optimized by using salient edges as auxiliary information, which effectively improves the contrast and completeness of the results. Experimental results on two public datasets demonstrate that the lightweight CSFFNet achieves competitive or even better performance compared with state‐of‐the‐art methods.

Published in IET Image Processing

ISSN: 1751-9659 (Print); 1751-9667 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Technology: Photography; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://ietresearch.onlinelibrary.wiley.com/journal/17519667

About the journal

Abstract

Keywords