EdgePose: Real-Time Human Pose Estimation Scheme for Industrial Scenes

Lei Zhang; Weifang Huang; Jiachun Zheng; Chaopeng Li; Xinyi Wu; Bingjie Xiang; Jiawen Yang; Deyin Xu

doi:10.1109/ACCESS.2024.3446247

IEEE Access (Jan 2024)

EdgePose: Real-Time Human Pose Estimation Scheme for Industrial Scenes

Lei Zhang,
Weifang Huang,
Jiachun Zheng,
Chaopeng Li,
Xinyi Wu,
Bingjie Xiang,
Jiawen Yang,
Deyin Xu

Affiliations

Lei Zhang: ORCiD; College of Ocean Information Engineering, Jimei University, Xiamen, China
Weifang Huang: ORCiD; College of Ocean Information Engineering, Jimei University, Xiamen, China
Jiachun Zheng: College of Ocean Information Engineering, Jimei University, Xiamen, China
Chaopeng Li: ORCiD; College of Ocean Information Engineering, Jimei University, Xiamen, China
Xinyi Wu: College of Ocean Information Engineering, Jimei University, Xiamen, China
Bingjie Xiang: College of Ocean Information Engineering, Jimei University, Xiamen, China
Jiawen Yang: College of Ocean Information Engineering, Jimei University, Xiamen, China
Deyin Xu: ORCiD; College of Ocean Information Engineering, Jimei University, Xiamen, China

DOI: https://doi.org/10.1109/ACCESS.2024.3446247
Journal volume & issue: Vol. 12
pp. 156702 – 156716

Abstract

Read online

Common human pose estimation methods rely on 2D heatmap regression, which requires expensive upsampling layers to maintain the resolution of the heatmap and additional post-processing for coordinate decoding. These components hinder the inference speed of human pose estimation tasks. To address this challenge, we propose a new real-time human pose estimation framework, EdgePose. First, we design the convolutional module EdgeBlock-C and the edge attention module EdgeBlock-T, and then build a hybrid network based on them to take advantage of both ConvNet and VIT. In addition, EdgePose simplifies the human pose estimation process by converting the output of the key point coordinates into a pixel classification task along the horizontal and vertical axes, thereby eliminating the upsampling and post-processing operations that may hinder inference speed, and speeding up the model’s inference speed while ensuring accuracy. The experimental results show that EdgePose-S achieved an AP score of 68.6 in the COCO validation test, and at the same time achieved an inference speed of 285.7 FPS on an Intel i9-10920X CPU. In the embedded Jetson Xavier NX environment, EdgePose-B achieved an AP score of 72.2 and an inference speed of 51.3 FPS, which is better than existing two-stage pose estimation methods.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords