IET Image Processing (Oct 2024)

A study on a target detection model for autonomous driving tasks

  • Hao Chen,
  • Byung‐Won Min,
  • Haifei Zhang

DOI
https://doi.org/10.1049/ipr2.13185
Journal volume & issue
Vol. 18, no. 12
pp. 3447 – 3459

Abstract

Read online

Abstract Target detection in autonomous driving tasks presents a complex and critical challenge due to the diversity of targets and the intricacy of the environment. To address this issue, this paper proposes an enhanced YOLOv8 model. Firstly, the original large target detection head is removed and replaced with a detection head tailored for small targets and high‐level semantic details. Secondly, an adaptive feature fusion method is proposed, where input feature maps are processed using dilated convolutions with different dilation rates, followed by adaptive feature fusion to generate adaptive weights. Finally, an improved attention mechanism is incorporated to enhance the model's focus on target regions. Additionally, the impact of Group Shuffle Convolution (GSConv) on the model's detection speed is investigated. Validated on two public datasets, the model achieves a mean Average Precision (mAP) of 53.7% and 53.5%. Although introducing GSConv results in a slight decrease in mAP, it significantly improves frames per second. These findings underscore the effectiveness of the proposed model in autonomous driving tasks.

Keywords