Human Pose Estimation from Monocular Images: A Comprehensive Survey

Wenjuan Gong; Xuena Zhang; Jordi Gonzàlez; Andrews Sobral; Thierry Bouwmans; Changhe Tu; El-hadi Zahzah

doi:10.3390/s16121966

Sensors (Nov 2016)

Human Pose Estimation from Monocular Images: A Comprehensive Survey

Wenjuan Gong,
Xuena Zhang,
Jordi Gonzàlez,
Andrews Sobral,
Thierry Bouwmans,
Changhe Tu,
El-hadi Zahzah

Affiliations

Wenjuan Gong: Department of Computer Science and Technology, China University of Petroleum, Qingdao 266580, China
Xuena Zhang: Department of Computer Science and Technology, China University of Petroleum, Qingdao 266580, China
Jordi Gonzàlez: Computer Vision Center, University Autònoma de Barcelona, 08193 Catalonia, Spain
Andrews Sobral: Laboratory MIA, University of La Rochelle, 17042 La Rochelle CEDEX, France
Thierry Bouwmans: Laboratory MIA, University of La Rochelle, 17042 La Rochelle CEDEX, France
Changhe Tu: School of Computer Science and Technology, Shandong University, Jinan 250100, China
El-hadi Zahzah: Laboratory L3i, University of La Rochelle, 17042 La Rochelle CEDEX, France

DOI: https://doi.org/10.3390/s16121966
Journal volume & issue: Vol. 16, no. 12
p. 1966

Abstract

Read online

Human pose estimation refers to the estimation of the location of body parts and how they are connected in an image. Human pose estimation from monocular images has wide applications (e.g., image indexing). Several surveys on human pose estimation can be found in the literature, but they focus on a certain category; for example, model-based approaches or human motion analysis, etc. As far as we know, an overall review of this problem domain has yet to be provided. Furthermore, recent advancements based on deep learning have brought novel algorithms for this problem. In this paper, a comprehensive survey of human pose estimation from monocular images is carried out including milestone works and recent advancements. Based on one standard pipeline for the solution of computer vision problems, this survey splits the problem into several modules: feature extraction and description, human body models, and modeling methods. Problem modeling methods are approached based on two means of categorization in this survey. One way to categorize includes top-down and bottom-up methods, and another way includes generative and discriminative methods. Considering the fact that one direct application of human pose estimation is to provide initialization for automatic video surveillance, there are additional sections for motion-related methods in all modules: motion features, motion models, and motion-based methods. Finally, the paper also collects 26 publicly available data sets for validation and provides error measurement methods that are frequently used.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords