IEEE Access (Jan 2020)

Far-Infrared Object Segmentation Focus on Transmission of Overall Semantic Information

  • Ying Zang,
  • Bo Yu,
  • Longjiao Yu,
  • Dongsheng Yang,
  • Qingshan Liu

DOI
https://doi.org/10.1109/ACCESS.2020.3028656
Journal volume & issue
Vol. 8
pp. 182564 – 182579

Abstract

Read online

In the vision task of a self-driving system, the use of visible light images to segment an object often loses its functionality at night or in harsh weather. The far-infrared image shows different pixel values according to the thermal radiation quantity of the object itself, so it can be adapted to perform well at night and in harsh weather conditions. However, at the same time, it has insufficient texture features, blurred object boundaries and temperature inversion, which has a great impact on the segmentation task of traditional algorithms. In response to the above problems, this article proposes a far-infrared object segmentation algorithm using deep learning. In the current popular encoding-decoding structure, multi-scale pooling layers are used to obtain receptive fields of different sizes. This is used to solve the effects caused by the blurring of infrared objects. The feature enhancement module is designed for the multi-receptive field feature map, which can filter out the most versatile and highly semantic feature channels to reduce the effect of temperature inversion on segmentation. The obtained high semantic feature map is guided into the decoding structure and is fused with the features obtained by the encoder and the decoder. This allows richer information to be obtained between different feature maps. Finally, we also release a new low-resolution far-infrared segmentation dataset. Experiments are performed on three datasets, and the segmentation result of the mIoU(mean Intersection over Union) reaches 70.59%, 30.98% and 60.67%. A large number of experiments confirm the effectiveness and robustness of the network in far-infrared images and verify that the dataset released in this article has strong reference significance.

Keywords