MFFnet: Multimodal Feature Fusion Network for Synthetic Aperture Radar and Optical Image Land Cover Classification

Yangyang Wang; Wengang Zhang; Weidong Chen; Chang Chen; Zhenyu Liang

doi:10.3390/rs16132459

Remote Sensing (Jul 2024)

MFFnet: Multimodal Feature Fusion Network for Synthetic Aperture Radar and Optical Image Land Cover Classification

Yangyang Wang,
Wengang Zhang,
Weidong Chen,
Chang Chen,
Zhenyu Liang

Affiliations

Yangyang Wang: School of Information Science and Technology, University of Science and Technology of China, Hefei 230037, China
Wengang Zhang: Electronic Countermeasure Institute, National University of Defense Technology, Hefei 230037, China
Weidong Chen: School of Information Science and Technology, University of Science and Technology of China, Hefei 230037, China
Chang Chen: School of Information Science and Technology, University of Science and Technology of China, Hefei 230037, China
Zhenyu Liang: Electronic Countermeasure Institute, National University of Defense Technology, Hefei 230037, China

DOI: https://doi.org/10.3390/rs16132459
Journal volume & issue: Vol. 16, no. 13
p. 2459

Abstract

Read online

Optical and Synthetic Aperture Radar (SAR) imagery offers a wealth of complementary information on a given target, attributable to the distinct imaging modalities of each component image type. Thus, multimodal remote sensing data have been widely used to improve land cover classification. However, fully integrating optical and SAR image data is not straightforward due to the distinct distributions of their features. To this end, we propose a land cover classification network based on multimodal feature fusion, i.e., MFFnet. We adopt a dual-stream network to extract features from SAR and optical images, where a ResNet network is utilized to extract deep features from optical images and PidiNet is employed to extract edge features from SAR. Simultaneously, the iAFF feature fusion module is used to facilitate data interactions between multimodal data for both low- and high-level features. Additionally, to enhance global feature dependency, the ASPP module is employed to handle the interactions between high-level features. The processed high-level features extracted from the dual-stream encoder are fused with low-level features and inputted into the decoder to restore the dimensional feature maps, generating predicted images. Comprehensive evaluations demonstrate that MFFnet achieves excellent performance in both qualitative and quantitative assessments on the WHU-OPT-SAR dataset. Compared to the suboptimal results, our method improves the OA and Kappa metrics by 7.7% and 11.26% on the WHU-OPT-SAR dataset, respectively.

Published in Remote Sensing

ISSN: 2072-4292 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science
Website: http://www.mdpi.com/journal/remotesensing/

About the journal

Abstract

Keywords