IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (Jan 2024)
Detection of SAR Image Multiscale Ship Targets in Complex Inshore Scenes Based on Improved YOLOv5
Abstract
Synthetic aperture radar (SAR) can operate around the clock and in all weather, and therefore high-resolution SAR images have been frequently applied for ship inspection. However, current ship target detection and identification methods have limited detection accuracy and lead to missing detection of small targets due to speckle noise caused by the imaging principle of SAR imagery and complex nearshore interference. Therefore, this article proposes an improved YOLOv5 method to address the problem of low accuracy in multiship target detection tasks in complex scenes. The developed scheme enhances the ship target detection performance while reducing the number of parameters. Specifically, first, we improve the size of the input SAR images and optimize the anchor frames of ship targets in the training dataset to locate small target ships more accurately. Then, asymmetric pyramidal nonlocal block and sim attention mechanism are introduced to reduce nearshore background interference. Additionally, to make the C3 module output richer and with more feature information, channel shuffling is performed after the C3 output to enhance the information exchange between channels. Finally, to reduce the number of parameters and computational cost during model training, the normal convolution in the neck part is replaced with Ghost convolution. The index F1 of the proposed method on the high-resolution SAR image dataset and SAR ship detection dataset reaches the highest of 91.3% and 95.8%, respectively. MAPs (0.5:0.95) for the two datasets are both the highest, which are at least 2% higher than the suboptimal method. In selected specific inshore scenes, the ship detection performance of the proposed method outperforms current advanced methods for multiscale ships. It is shown that the proposed method can extract ship features effectively in complex scenes and its effectiveness is further validated on the large-scene AIR–SARShip-1 dataset.
Keywords