Applied Sciences (Oct 2024)

YOLO-ESL: An Enhanced Pedestrian Recognition Network Based on YOLO

  • Feilong Wang,
  • Xiaobing Yang,
  • Juan Wei

DOI
https://doi.org/10.3390/app14209588
Journal volume & issue
Vol. 14, no. 20
p. 9588

Abstract

Read online

Pedestrian detection is a critical task in computer vision; however, mainstream algorithms often struggle to achieve high detection accuracy in complex scenarios, particularly due to target occlusion and the presence of small objects. This paper introduces a novel pedestrian detection algorithm, YOLO-ESL, based on the YOLOv7 framework. YOLO-ESL integrates the ELAN-SA module, designed to enhance feature extraction, with the LGA module, which improves feature fusion. The ELAN-SA module optimizes the flexibility and efficiency of small object feature extraction, while the LGA module effectively integrates multi-scale features through local and global attention mechanisms. Additionally, the CIOUNMS algorithm addresses the issue of target loss in cases of high overlap, improving boundary box filtering. Evaluated on the VOC2012 pedestrian dataset, YOLO-ESL achieved an accuracy of 93.7%, surpassing the baseline model by 3.0%. Compared to existing methods, this model not only demonstrates strong performance in handling occluded and small object detection but also remarkable robustness and efficiency.

Keywords