Vertex position estimation with spatial–temporal transformer for 3D human reconstruction

Xiangjun Zhang; Yinglin Zheng; Wenjin Deng; Qifeng Dai; Yuxin Lin; Wangzheng Shi; Ming Zeng

Graphical Models (Dec 2023)

Vertex position estimation with spatial–temporal transformer for 3D human reconstruction

Xiangjun Zhang,
Yinglin Zheng,
Wenjin Deng,
Qifeng Dai,
Yuxin Lin,
Wangzheng Shi,
Ming Zeng

Affiliations

Xiangjun Zhang: Xiamen University, Xiamen, China
Yinglin Zheng: Xiamen University, Xiamen, China
Wenjin Deng: Xiamen University, Xiamen, China
Qifeng Dai: Xiamen University, Xiamen, China
Yuxin Lin: Xiamen University, Xiamen, China
Wangzheng Shi: Xiamen University, Xiamen, China
Ming Zeng: Corresponding author.; Xiamen University, Xiamen, China

Journal volume & issue: Vol. 130
p. 101207

Abstract

Read online

Reconstructing 3D human pose and body shape from monocular images or videos is a fundamental task for comprehending human dynamics. Frame-based methods can be broadly categorized into two fashions: those regressing parametric model parameters (e.g., SMPL) and those exploring alternative representations (e.g., volumetric shapes, 3D coordinates). Non-parametric representations have demonstrated superior performance due to their enhanced flexibility. However, when applied to video data, these non-parametric frame-based methods tend to generate inconsistent and unsmooth results. To this end, we present a novel approach that directly regresses the 3D coordinates of the mesh vertices and body joints with a spatial–temporal Transformer. In our method, we introduce a SpatioTemporal Learning Block (STLB) with Spatial Learning Module (SLM) and Temporal Learning Module (TLM), which leverages spatial and temporal information to model interactions at a finer granularity, specifically at the body token level. Our method outperforms previous state-of-the-art approaches on Human3.6M and 3DPW benchmark datasets.

Published in Graphical Models

ISSN: 1524-0703 (Print); 1524-0711 (Online)
Publisher: Elsevier
Country of publisher: United States
LCC subjects: Science; Technology: Technology (General)
Website: https://www.sciencedirect.com/journal/graphical-models

About the journal

Abstract

Keywords