3D hypothesis clustering for cross-view matching in multi-person motion capture

Miaopeng Li; Zimeng Zhou; Xinguo Liu

doi:10.1007/s41095-020-0171-y

Computational Visual Media (Jun 2020)

3D hypothesis clustering for cross-view matching in multi-person motion capture

Miaopeng Li,
Zimeng Zhou,
Xinguo Liu

Affiliations

Miaopeng Li: State Key Lab of CAD&CG, Zhejiang University
Zimeng Zhou: State Key Lab of CAD&CG, Zhejiang University
Xinguo Liu: State Key Lab of CAD&CG, Zhejiang University

DOI: https://doi.org/10.1007/s41095-020-0171-y
Journal volume & issue: Vol. 6, no. 2
pp. 147 – 156

Abstract

Read online

Abstract We present a multiview method for markerless motion capture of multiple people. The main challenge in this problem is to determine cross-view correspondences for the 2D joints in the presence of noise. We propose a 3D hypothesis clustering technique to solve this problem. The core idea is to transform joint matching in 2D space into a clustering problem in a 3D hypothesis space. In this way, evidence from photometric appearance, multiview geometry, and bone length can be integrated to solve the clustering problem efficiently and robustly. Each cluster encodes a set of matched 2D joints for the same person across different views, from which the 3D joints can be effectively inferred. We then assemble the inferred 3D joints to form full-body skeletons for all persons in a bottom–up way. Our experiments demonstrate the robustness of our approach even in challenging cases with heavy occlusion, closely interacting people, and few cameras. We have evaluated our method on many datasets, and our results show that it has significantly lower estimation errors than many state-of-the-art methods.

Published in Computational Visual Media

ISSN: 2096-0433 (Print); 2096-0662 (Online)
Publisher: SpringerOpen
Country of publisher: China
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.springer.com/41095

About the journal

Abstract

Keywords