Motion Capture Research: 3D Human Pose Recovery Based on RGB Video Sequences

Xin Min; Shouqian Sun; Honglie Wang; Xurui Zhang; Chao Li; Xianfu Zhang

doi:10.3390/app9173613

Applied Sciences (Sep 2019)

Motion Capture Research: 3D Human Pose Recovery Based on RGB Video Sequences

Xin Min,
Shouqian Sun,
Honglie Wang,
Xurui Zhang,
Chao Li,
Xianfu Zhang

Affiliations

Xin Min: College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China
Shouqian Sun: College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China
Honglie Wang: College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China
Xurui Zhang: College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China
Chao Li: College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China
Xianfu Zhang: College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China

DOI: https://doi.org/10.3390/app9173613
Journal volume & issue: Vol. 9, no. 17
p. 3613

Abstract

Read online

Using video sequences to restore 3D human poses is of great significance in the field of motion capture. This paper proposes a novel approach to estimate 3D human action via end-to-end learning of deep convolutional neural network to calculate the parameters of the parameterized skinned multi-person linear model. The method is divided into two main stages: (1) 3D human pose estimation based on a single frame image. We use 2D/3D skeleton point constraints, human height constraints, and generative adversarial network constraints to obtain a more accurate human-body model. The model is pre-trained using open-source human pose datasets; (2) Human-body pose generation based on video streams. Combined with the correlation of video sequences, a 3D human pose recovery method based on video streams is proposed, which uses the correlation between videos to generate a smoother 3D pose. In addition, we compared the proposed 3D human pose recovery method with the commercial motion capture platform to prove the effectiveness of the proposed method. To make a contrast, we first built a motion capture platform through two Kinect (V2) devices and iPi Soft series software to obtain depth-camera video sequences and monocular-camera video sequences respectively. Then we defined several different tasks, including the speed of the movements, the position of the subject, the orientation of the subject, and the complexity of the movements. Experimental results show that our low-cost method based on RGB video data can achieve similar results to commercial motion capture platform with RGB-D video data.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords