Target Detection Model Distillation Using Feature Transition and Label Registration for Remote Sensing Imagery

Boya Zhao; Qing Wang; Yuanfeng Wu; Qingqing Cao; Qiong Ran

doi:10.1109/JSTARS.2022.3188252

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (Jan 2022)

Target Detection Model Distillation Using Feature Transition and Label Registration for Remote Sensing Imagery

Boya Zhao,
Qing Wang,
Yuanfeng Wu,
Qingqing Cao,
Qiong Ran

Affiliations

Boya Zhao: ORCiD; Key Laboratory of Computational Optical Imaging Technology, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, China
Qing Wang: College of Information Science and Technology, Beijing University of Chemical Technology, Beijing, China
Yuanfeng Wu: ORCiD; Key Laboratory of Computational Optical Imaging Technology, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, China
Qingqing Cao: ORCiD; Key Laboratory of Computational Optical Imaging Technology, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, China
Qiong Ran: College of Information Science and Technology, Beijing University of Chemical Technology, Beijing, China

DOI: https://doi.org/10.1109/JSTARS.2022.3188252
Journal volume & issue: Vol. 15
pp. 5416 – 5426

Abstract

Read online

Deep convolution networks have been widely used in remote sensing target detection for various applications in recent years. Target detection models with many parameters provide better results but are not suitable for resource-constrained devices due to their high computational cost and storage requirements. Furthermore, current lightweight target detection models for remote sensing imagery rarely have the advantages of existing models. Knowledge distillation can improve the learning ability of a small student network from a large teacher network due to acceleration and compression. However, current knowledge distillation methods typically use mature backbones as teacher and student networks are unsuitable for target detection in remote sensing imagery. In this article, we propose a target detection model distillation (TDMD) framework using feature transition and label registration for remote sensing imagery. A lightweight attention network is designed by ranking the importance of the convolutional feature layers in the teacher network. Multiscale feature transition based on a feature pyramid is utilized to constrain the feature maps of the student network. A label registration procedure is proposed to improve the TDMD model's learning ability of the output distribution of the teacher network. The proposed method is evaluated on the DOTA and NWPU VHR-10 remote sensing image datasets. The results show that the TDMD achieves a mean Average Precision (mAP) of 75.47% and 93.81% on the DOTA and NWPU VHR-10 datasets, respectively. Moreover, the model size is 43% smaller than that of the predecessor model (11.8 MB and 11.6 MB for the two datasets).

Published in IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

ISSN: 1939-1404 (Print); 2151-1535 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Ocean engineering; Science: Physics: Geophysics. Cosmic physics
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=4609443

About the journal

Abstract

Keywords