Mixing body‐parts model for 2D human pose estimation in stereo videos

Manuel I. López‐Quintero; Manuel J. Marín‐Jiménez; Rafael Muñoz‐Salinas; Rafael Medina‐Carnicer

doi:10.1049/iet-cvi.2016.0249

IET Computer Vision (Sep 2017)

Mixing body‐parts model for 2D human pose estimation in stereo videos

Manuel I. López‐Quintero,
Manuel J. Marín‐Jiménez,
Rafael Muñoz‐Salinas,
Rafael Medina‐Carnicer

Affiliations

Manuel I. López‐Quintero: Department Computing and Numerical AnalysisUniversity of CórdobaCampus de Rabanales14071CórdobaSpain
Manuel J. Marín‐Jiménez: Department Computing and Numerical AnalysisUniversity of CórdobaCampus de Rabanales14071CórdobaSpain
Rafael Muñoz‐Salinas: Department Computing and Numerical AnalysisUniversity of CórdobaCampus de Rabanales14071CórdobaSpain
Rafael Medina‐Carnicer: Department Computing and Numerical AnalysisUniversity of CórdobaCampus de Rabanales14071CórdobaSpain

DOI: https://doi.org/10.1049/iet-cvi.2016.0249
Journal volume & issue: Vol. 11, no. 6
pp. 426 – 433

Abstract

Read online

This study targets 2D articulated human pose estimation (i.e. localisation of body limbs) in stereo videos. Although in recent years depth‐based devices (e.g. Microsoft Kinect) have gained popularity, as they perform very well in controlled indoor environments (e.g. living rooms, operating theatres or gyms), they suffer clear problems in outdoor scenarios and, therefore, human pose estimation is still an interesting unsolved problem. The authors propose here a novel approach that is able to localise upper‐body keypoints (i.e. shoulders, elbows, and wrists) in temporal sequences of stereo image pairs. The authors’ method starts by locating and segmenting people in the image pairs by using disparity and appearance information. Then, a set of candidate body poses is computed for each view independently. Finally, temporal and stereo consistency is applied to estimate a final 2D pose. The authors’ validate their model on three challenging datasets: ‘stereo human pose estimation dataset’, ‘poses in the wild’ and ‘INRIA 3DMovie’. The experimental results show that the authors’ model not only establishes new state‐of‐the‐art results on stereo sequences, but also brings improvements in monocular sequences.

Published in IET Computer Vision

ISSN: 1751-9632 (Print); 1751-9640 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://ietresearch.onlinelibrary.wiley.com/journal/17519640

About the journal

Abstract

Keywords