Remote Sensing (Apr 2023)

A Multi-Feature Fusion and Attention Network for Multi-Scale Object Detection in Remote Sensing Images

  • Yong Cheng,
  • Wei Wang,
  • Wenjie Zhang,
  • Ling Yang,
  • Jun Wang,
  • Huan Ni,
  • Tingzhao Guan,
  • Jiaxin He,
  • Yakang Gu,
  • Ngoc Nguyen Tran

DOI
https://doi.org/10.3390/rs15082096
Journal volume & issue
Vol. 15, no. 8
p. 2096

Abstract

Read online

Accurate multi-scale object detection in remote sensing images poses a challenge due to the complexity of transferring deep features to shallow features among multi-scale objects. Therefore, this study developed a multi-feature fusion and attention network (MFANet) based on YOLOX. By reparameterizing the backbone, fusing multi-branch convolution and attention mechanisms, and optimizing the loss function, the MFANet strengthened the feature extraction of objects at different sizes and increased the detection accuracy. The ablation experiment was carried out on the NWPU VHR-10 dataset. Our results showed that the overall performance of the improved network was around 2.94% higher than the average performance of every single module. Based on the comparison experiments, the improved MFANet demonstrated a high mean average precision of 98.78% for 9 classes of objects in the NWPU VHR-10 10-class detection dataset and 94.91% for 11 classes in the DIOR 20-class detection dataset. Overall, MFANet achieved an mAP of 96.63% and 87.88% acting on the NWPU VHR-10 and DIOR datasets, respectively. This method can promote the development of multi-scale object detection in remote sensing images and has the potential to serve and expand intelligent system research in related fields such as object tracking, semantic segmentation, and scene understanding.

Keywords