Joint Path Alignment Framework for 3D Human Pose and Shape Estimation From Video

Ji Woo Hong; Sunjae Yoon; Junyeong Kim; Chang D. Yoo

doi:10.1109/ACCESS.2023.3271285

IEEE Access (Jan 2023)

Joint Path Alignment Framework for 3D Human Pose and Shape Estimation From Video

Ji Woo Hong,
Sunjae Yoon,
Junyeong Kim,
Chang D. Yoo

Affiliations

Ji Woo Hong: ORCiD; School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea
Sunjae Yoon: School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea
Junyeong Kim: ORCiD; Department of Artificial Intelligence, Chung-Ang University, Seoul, Republic of Korea
Chang D. Yoo: ORCiD; School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea

DOI: https://doi.org/10.1109/ACCESS.2023.3271285
Journal volume & issue: Vol. 11
pp. 43267 – 43275

Abstract

Read online

3D human pose and shape estimation (3D-HPSE) from video aims to generate sequence of 3D mesh that depict human body in the video. Current deep learning based 3D-HPSE networks that takes video input have focused on improving temporal consistency among sequence of 3D joints by supervising acceleration error between predicted and ground-truth human motion. However, these methods overlooked the geometric misalignments of persistent discrepancy between geometric paths drawn by sequence of predicted joints and that of ground-truth joints. To this end, we propose Joint Path Alignment (JPA) framework, a model-agnostic approach that mitigates geometric misalignments by introducing Temporal Procrustes Alignment Regularization (TPAR) loss that performs group-wise sequence learning of joint movement paths. Unlike previous methods that rely solely on per-frame supervision for accuracy, our framework adds sequence-level accuracy supervision with TPAR loss by performing Procrustes analysis on the geometric paths drawn by sequences of predicted joints. Our experiments show that JPA framework advances the network to exceed the previous state-of-the-art performances on benchmark datasets in both per-frame accuracy and video smoothness metric.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords