IEEE Access (Jan 2020)

Robust Head Detection in Complex Videos Using Two-Stage Deep Convolution Framework

  • Sultan Daud Khan,
  • Yasir Ali,
  • Basim Zafar,
  • Abdulfattah Noorwali

DOI
https://doi.org/10.1109/ACCESS.2020.2995764
Journal volume & issue
Vol. 8
pp. 98679 – 98692

Abstract

Read online

Pedestrian head detection plays an important role in identifying and localizing individuals in real world visual data. Head detection is a nontrivial problem due to considerable variance in camera view-points, scales, human poses, and appearances in the scene. Thanks to the translation invariance property of convolutional neural networks (CNNs) which enables large capacity CNNs to handle the problem of appearance and pose variations in the scene. However, the problem of scale invariance is still an open issue. To address this problem, this paper presents a two-stage head detection framework that utilizes fully convolutional network (FCN) to generate scale-aware proposals followed by CNN that classifies each proposal into two classes, i.e. head and background. Experiments results show that using scale-aware proposals obtained by FCN, the object recall rate and mean average precision (mAP) are improved. Additionaly, we demonstrate that our framework achieved state-of-the-art results on four challenging benchmark datasets, i.e. HollywoodHeads, Casablanca, SHOCK, and WIDERFACE.

Keywords