Training‐based head pose estimation under monocular vision

Zhizhi Guo; Qianxiang Zhou; Zhongqi Liu; Chunhui Liu

doi:10.1049/iet-cvi.2015.0457

IET Computer Vision (Dec 2016)

Training‐based head pose estimation under monocular vision

Zhizhi Guo,
Qianxiang Zhou,
Zhongqi Liu,
Chunhui Liu

Affiliations

Zhizhi Guo: School of Biological Science and Medical EngineeringBeihang UniversityBeijing100191People's Republic of China
Qianxiang Zhou: School of Biological Science and Medical EngineeringBeihang UniversityBeijing100191People's Republic of China
Zhongqi Liu: School of Biological Science and Medical EngineeringBeihang UniversityBeijing100191People's Republic of China
Chunhui Liu: School of Biological Science and Medical EngineeringBeihang UniversityBeijing100191People's Republic of China

DOI: https://doi.org/10.1049/iet-cvi.2015.0457
Journal volume & issue: Vol. 10, no. 8
pp. 798 – 805

Abstract

Read online

Although many 3D head pose estimation methods based on monocular vision can achieve an accuracy of 5°, how to reduce the number of required training samples and how to not to use any hardware parameters as input features are still among the biggest challenges in the field of head pose estimation. To aim at these challenges, the authors propose an accurate head pose estimation method which can act as an extension to facial key point detection systems. The basic idea is to use the normalised distance between key points as input features, and to use ℓ1‐minimisation to select a set of sparse training samples which can reflect the mapping relationship between the feature vector space and the head pose space. The linear combination of the head poses corresponding to these samples represents the head pose of the test sample. The experiment results show that the authors’ method can achieve an accuracy of 2.6° without any extra hardware parameters or information of the subject. In addition, in the case of large head movement and varying illumination, the authors’ method is still able to estimate the head pose.

Published in IET Computer Vision

ISSN: 1751-9632 (Print); 1751-9640 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://ietresearch.onlinelibrary.wiley.com/journal/17519640

About the journal

Abstract

Keywords