3D driver pose estimation based on joint 2D–3D network

Zhijie Yao; Yazhou Liu; Zexuan Ji; Quansen Sun; Pongsak Lasang; Shengmei Shen

doi:10.1049/iet-cvi.2019.0089

IET Computer Vision (Apr 2020)

3D driver pose estimation based on joint 2D–3D network

Zhijie Yao,
Yazhou Liu,
Zexuan Ji,
Quansen Sun,
Pongsak Lasang,
Shengmei Shen

Affiliations

Zhijie Yao: School of Computer Science and Engineering, Nanjing University of Science and TechnologyNanjing210094People's Republic of China
Yazhou Liu: School of Computer Science and Engineering, Nanjing University of Science and TechnologyNanjing210094People's Republic of China
Zexuan Ji: School of Computer Science and Engineering, Nanjing University of Science and TechnologyNanjing210094People's Republic of China
Quansen Sun: School of Computer Science and Engineering, Nanjing University of Science and TechnologyNanjing210094People's Republic of China
Pongsak Lasang: Panasonic Research and Development CenterSingaporeRepublic of Singapore
Shengmei Shen: Panasonic Research and Development CenterSingaporeRepublic of Singapore

DOI: https://doi.org/10.1049/iet-cvi.2019.0089
Journal volume & issue: Vol. 14, no. 3
pp. 84 – 91

Abstract

Read online

Three‐dimensional (3D) driver pose estimation is a promising and challenging problem for computer–human interaction. Recently convolutional neural networks have been introduced into 3D pose estimation, but these methods have the problem of slow running speed and are not suitable for driving scenario. In this study, the proposed method is based on two types of inputs, infrared image and point cloud obtained from time‐of‐flight camera. The authors propose a joint 2D–3D network incorporating image‐based and point‐based feature to promote the performance of 3D human pose estimation and run on a high speed. For point cloud with invalid points, the authors first do preprocess and then design a denoising module to handle this problem. Experiments on private driver data set and public Invariant‐Top View data set show that the proposed method achieves efficient and competitive performance on 3D human pose estimation.

Published in IET Computer Vision

ISSN: 1751-9632 (Print); 1751-9640 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://ietresearch.onlinelibrary.wiley.com/journal/17519640

About the journal

Abstract

Keywords