IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (Jan 2024)

Multimodal Object Detection of UAV Remote Sensing Based on Joint Representation Optimization and Specific Information Enhancement

  • Jinpeng Wang,
  • Congan Xu,
  • Chunhui Zhao,
  • Long Gao,
  • Junfeng Wu,
  • Yiming Yan,
  • Shou Feng,
  • Nan Su

DOI
https://doi.org/10.1109/JSTARS.2024.3373816
Journal volume & issue
Vol. 17
pp. 12364 – 12373

Abstract

Read online

With the development of Earth observation technology, it becomes easier and easier to acquire multimodal image data at the same time. To improve the performance of a multimodal remote-sensing detection algorithm, a new fusion feature optimization detection network is proposed. The method is designed to solve the problem of performance degradation caused by the unreliability of single-modal data in multimodal remote-sensing data. The key to obtain high-quality fusion features from multimodal data with interference is to suppress single-modal redundant features and fully integrate multimodal features. The proposed method mainly includes two improvements. First, a novel joint expression optimization module is designed to enhance the target features and suppress the redundant and interference features that affect the fusion effect. In addition, we propose a novel specific information enhancement module to further enhance the discriminative feature information of targets within each modal image. Experiments on the DroneVehicle dataset show that our proposed method is state of the art on this dataset.

Keywords