A New Deep Model for Detecting Multiple Moving Targets in Real Traffic Scenarios: Machine Vision-Based Vehicles

Xiaowei Xu; Hao Xiong; Liu Zhan; Grzegorz Królczyk; Rafal Stanislawski; Paolo Gardoni; Zhixiong Li

doi:10.3390/s22103742

Sensors (May 2022)

A New Deep Model for Detecting Multiple Moving Targets in Real Traffic Scenarios: Machine Vision-Based Vehicles

Xiaowei Xu,
Hao Xiong,
Liu Zhan,
Grzegorz Królczyk,
Rafal Stanislawski,
Paolo Gardoni,
Zhixiong Li

Affiliations

Xiaowei Xu: School of Automobile and Traffic Engineering, Wuhan University of Science and Technology, Wuhan 430081, China
Hao Xiong: School of Automobile and Traffic Engineering, Wuhan University of Science and Technology, Wuhan 430081, China
Liu Zhan: School of Automobile and Traffic Engineering, Wuhan University of Science and Technology, Wuhan 430081, China
Grzegorz Królczyk: Department of Manufacturing Engineering and Automation Products, Opole University of Technology, 45758 Opole, Poland
Rafal Stanislawski: Department of Electrical, Control and Computer Engineering, Opole University of Technology, 45758 Opole, Poland
Paolo Gardoni: Department of Civil and Environmental Engineering, University of Illinois at Urbana-Champaign, Champaign, IL 61820, USA
Zhixiong Li: Department of Manufacturing Engineering and Automation Products, Opole University of Technology, 45758 Opole, Poland

DOI: https://doi.org/10.3390/s22103742
Journal volume & issue: Vol. 22, no. 10
p. 3742

Abstract

Read online

When performing multiple target detection, it is difficult to detect small and occluded targets in complex traffic scenes. To this end, an improved YOLOv4 detection method is proposed in this work. Firstly, the network structure of the original YOLOv4 is adjusted, and the 4× down-sampling feature map of the backbone network is introduced into the neck network of the YOLOv4 model to splice the feature map with 8× down-sampling to form a four-scale detection structure, which enhances the fusion of deep and shallow semantics information of the feature map to improve the detection accuracy of small targets. Then, the convolutional block attention module (CBAM) is added to the model neck network to enhance the learning ability for features in space and on channels. Lastly, the detection rate of the occluded target is improved by using the soft non-maximum suppression (Soft-NMS) algorithm based on the distance intersection over union (DIoU) to avoid deleting the bounding boxes. On the KITTI dataset, experimental evaluation is performed and the analysis results demonstrate that the proposed detection model can effectively improve the multiple target detection accuracy, and the mean average accuracy (mAP) of the improved YOLOv4 model reaches 81.23%, which is 3.18% higher than the original YOLOv4; and the computation speed of the proposed model reaches 47.32 FPS. Compared with existing popular detection models, the proposed model produces higher detection accuracy and computation speed.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords