IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (Jan 2023)

Decoupled Feature Pyramid Learning for Multi-Scale Object Detection in Low-Altitude Remote Sensing Images

  • Haokai Sun,
  • Yaxiong Chen,
  • Xiongbo Lu,
  • Shengwu Xiong

DOI
https://doi.org/10.1109/JSTARS.2023.3293395
Journal volume & issue
Vol. 16
pp. 6556 – 6567

Abstract

Read online

Recently, low-altitude remote sensing platforms are widely used for various practical applications. Object detection is a basic and significant technology, serving them. The scale imbalance problem is predominant in low-altitude remote sensing images, which brings a great challenge to detect objects from these imageries. Consequently, in this article, we boost performance from the perspective of mitigating scale imbalance issues. First, we choose a one-stage object detector with decoupled heads as the baseline because of its comparatively high efficiency and accuracy. Current-decoupled heads ignore the interlayer relationship and the information contained. On the other hand, all existing feature pyramid structures generate one feature map for two branches at every layer. Inspired by them, we propose a novel feature pyramid network paradigm—decoupled feature pyramid network with consideration of different preferences for classification and localization. Meanwhile, the introduction of feature pyramid architecture will cause performance deterioration of larger objects because upper layers receive insufficient supervision in the training phase. Therefore, we adopt a distinct supervision strategy—level supervision, which pays more attention to upper layers. We demonstrate extensive experiments on two popular benchmarks of object detection in low-altitude remote sensing images to validate the effectiveness of our proposed method. In addition, we introduce a scale imbalance metric to quantify the degree of size change of objects to better illustrate the ability to relieve the scale imbalance problem. Finally, our proposed approach achieves state-of-the-art performance on both datasets.

Keywords