A lightweight CNN for multi-source infrared ship detection from unmanned marine vehicles
Liqian Wang,
Yakui Dong,
Cheng Fei,
Junliang Liu,
Shuzhen Fan,
Yunxia Liu,
Yongfu Li,
Zhaojun Liu,
Xian Zhao
Affiliations
Liqian Wang
Key Laboratory of Laser & Infrared System, Shandong University, Ministry of Education, 72 Binhai Road, Qingdao, 266237, Shandong, China; Center for Optics Research and Engineering, 72 Binhai Road, Qingdao, 266237, Shandong, China
Yakui Dong
Key Laboratory of Laser & Infrared System, Shandong University, Ministry of Education, 72 Binhai Road, Qingdao, 266237, Shandong, China; Center for Optics Research and Engineering, 72 Binhai Road, Qingdao, 266237, Shandong, China
Cheng Fei
Key Laboratory of Laser & Infrared System, Shandong University, Ministry of Education, 72 Binhai Road, Qingdao, 266237, Shandong, China; Center for Optics Research and Engineering, 72 Binhai Road, Qingdao, 266237, Shandong, China
Junliang Liu
Key Laboratory of Laser & Infrared System, Shandong University, Ministry of Education, 72 Binhai Road, Qingdao, 266237, Shandong, China; Center for Optics Research and Engineering, 72 Binhai Road, Qingdao, 266237, Shandong, China
Shuzhen Fan
Key Laboratory of Laser & Infrared System, Shandong University, Ministry of Education, 72 Binhai Road, Qingdao, 266237, Shandong, China; Center for Optics Research and Engineering, 72 Binhai Road, Qingdao, 266237, Shandong, China
Yunxia Liu
Key Laboratory of Laser & Infrared System, Shandong University, Ministry of Education, 72 Binhai Road, Qingdao, 266237, Shandong, China; Center for Optics Research and Engineering, 72 Binhai Road, Qingdao, 266237, Shandong, China
Yongfu Li
Key Laboratory of Laser & Infrared System, Shandong University, Ministry of Education, 72 Binhai Road, Qingdao, 266237, Shandong, China; Center for Optics Research and Engineering, 72 Binhai Road, Qingdao, 266237, Shandong, China; Corresponding author at: Center for Optics Research and Engineering, 72 Binhai Road, Qingdao, 266237, Shandong, China.
Zhaojun Liu
Key Laboratory of Laser & Infrared System, Shandong University, Ministry of Education, 72 Binhai Road, Qingdao, 266237, Shandong, China; School of Information Science and Engineering, 72 Binhai Road, Qingdao, 266237, Shandong, China
Xian Zhao
Key Laboratory of Laser & Infrared System, Shandong University, Ministry of Education, 72 Binhai Road, Qingdao, 266237, Shandong, China; Center for Optics Research and Engineering, 72 Binhai Road, Qingdao, 266237, Shandong, China
Infrared ship detection is of great significance due to its broad applicability in maritime surveillance, traffic safety and security. Multiple infrared sensors with different spectral sensitivity provide enhanced sensing capabilities, facilitating ship detection in complex environments. Nevertheless, current researches lack discussion and exploration of infrared imagers in different spectral ranges for marine objects detection. Furthermore, for unmanned marine vehicles (UMVs), e.g., unmanned surface vehicles (USVs) and unmanned ship (USs), detection and perception are usually performed in embedded devices with limited memory and computation resource, which makes traditional convolutional neural network (CNN)-based detection methods struggle to leverage their advantages. Aimed at the task of sea surface object detection on USVs, this paper provides lightweight CNNs with high inference speed that can be deployed on embedded devices. It also discusses the advantages and disadvantages of using different sensors in marine object detection, providing a reference for the perception and decision-making modules of USVs. The proposed method can detect ships in short-wave infrared (SWIR), long-wave infrared (LWIR) and fused images with high-performance and high-inference speed on an embedded device. Specifically, the backbone is built from bottleneck depth-separable convolution with residuals. Generating redundant feature maps by using cheap linear operation in neck and head networks. The learning and representation capacities of the network are promoted by introducing the channel and spatial attention, redesigning the sizes of anchor boxes. Comparative experiments are conducted on the infrared ship dataset that we have released which contains SWIR, LWIR and the fused images. The results indicate that the proposed method can achieve high accuracy but with fewer parameters, and the inference speed is nearly 60 frames per second (FPS) on an embedded device.