DCPDN: High-Performance Pedestrian Detection Networks for Difficult Conditions

Yi Chen; Xinqing Wang; Yan Ouyang

doi:10.1109/ACCESS.2023.3293120

IEEE Access (Jan 2023)

DCPDN: High-Performance Pedestrian Detection Networks for Difficult Conditions

Yi Chen,
Xinqing Wang,
Yan Ouyang

Affiliations

Yi Chen: ORCiD; Department of Mechanical Engineering, College of Field Engineering, Army Engineering University, Nanjing, China
Xinqing Wang: Department of Mechanical Engineering, College of Field Engineering, Army Engineering University, Nanjing, China
Yan Ouyang: Department of Mechanical Engineering, College of Field Engineering, Army Engineering University, Nanjing, China

DOI: https://doi.org/10.1109/ACCESS.2023.3293120
Journal volume & issue: Vol. 11
pp. 71371 – 71386

Abstract

Read online

Pedestrian detection is the use of computer vision technology to identify and accurately locate pedestrians in image or video data, which has a strong use value. This technology can be used as the research basis for visual tasks such as person re-identification, human pose estimation and behavior analysis, and can also be applied to industrial fields such as intelligent security, automatic driving and human-computer interaction. However, the problems of low image resolution, blurred appearance, large scale difference of pedestrians, occluded pedestrians and complex background still bring great challenges to the detection performance. To solve these problems, this paper proposes a high-performance pedestrian detection network dedicated to difficult conditions: DCPDN. Firstly, we design an optimized super-resolution reconstruction network to preprocess the image to alleviate the performance damage caused by low-resolution and blurred images. Then, to solve the multi-scale problem in pedestrian detection, we propose a weighted cross-scale feature fusion module, which adopts a hierarchical detection strategy to deal with pedestrian objects of different scales while fully fusing feature maps of different levels. Finally, to solve the occlusion problem that has plagued pedestrian detection for a long time, we design an occlusion processing module based on graph convolutional network, which can effectively use the correlation information between different parts of the human body and promote the feature expression of occluded objects. On the CityPersons dataset, the ${MR}^{-2}$ of the detector is reduced by 6.9%, 19.2%, 8.9%, 1.9%, 3.6% and 14.2%, respectively, corresponding to different partition subsets of R, HO, A, L, M and S. On the Caltech dataset, corresponding to different divisions of R, HO, A, L and S, the ${MR}^{-2}$ of the detector is reduced by 9.9%, 15.8%, 16.3%, 6.8% and 25.8%, respectively. The experimental results show that the performance improvement of the detector is significant on both severe occlusion (HO) and small scale (S) subsets. After testing, the algorithm has strong robustness to occluded pedestrians, and can be easily embedded in other detection frameworks. Our DCPDN is able to compete with the state of the art methods and is especially effective when dealing with the pedestrian detection problem under difficult conditions.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords