MFCANet: Multiscale Feature Context Aggregation Network for Oriented Object Detection in Remote-Sensing Images

Honghui Jiang; Tingting Luo; Hu Peng; Guozheng Zhang

doi:10.1109/ACCESS.2024.3381539

IEEE Access (Jan 2024)

MFCANet: Multiscale Feature Context Aggregation Network for Oriented Object Detection in Remote-Sensing Images

Honghui Jiang,
Tingting Luo,
Hu Peng,
Guozheng Zhang

Affiliations

Honghui Jiang: School of Internet and Communication, Anhui Technical College of Mechanical and Electrical Engineering, Wuhu, China
Tingting Luo: State Gride Wuhu Power Supply Company, Wuhu, China
Hu Peng: ORCiD; School of Instrument Science and Opto-Electronics Engineering, Hefei University of Technology, Hefei, China
Guozheng Zhang: ORCiD; School of Mechanical Engineering, Anhui Technical College of Mechanical and Electrical Engineering, Wuhu, China

DOI: https://doi.org/10.1109/ACCESS.2024.3381539
Journal volume & issue: Vol. 12
pp. 45986 – 46001

Abstract

Read online

Rotated object detection in remote sensing images presents a highly challenging task due to the extensive fields of view and complex backgrounds. While Convolutional Neural Networks (CNNs) and Transformer networks have made progress in this area, there is still a lack of research on extracting and fusing features for small targets in complex backgrounds. To address this gap, we have extended the RTMDet framework by introducing three modules: the Focused Feature Context Aggregation Module, the Feature Context Information Enhancement Module, and the Multi-scale Feature Fusion Module. In the Focused Feature Context Aggregation Module, we replaced the Spatial Pyramid Pooling Bottleneck (SPPFBottleneck) to better extract small target features by focusing on contextual information. The Feature Context Information Enhancement Module enhances the model’s perception of multi-dimensional temporal and spatial information. Finally, we combined the original features with the fused ones to prevent the loss of specific features during the fusion process. Our proposed model, named the Multi-scale Feature Context Aggregation Network (MFCANet), was evaluated on four challenging remote sensing datasets (MAR20, SRSDD, HRSC, and DIOR-R). The experimental results demonstrate that our method outperforms baseline models, achieving improvements of 2.13%, 10.28%, 1.46%, and 1.13% in mAP for the MAR20, SRSDD, HRSC, and DIOR-R datasets, respectively.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords