An Efficient Approach Using Knowledge Distillation Methods to Stabilize Performance in a Lightweight Top-Down Posture Estimation Network

Changhyun Park; Hean Sung Lee; Woo Jin Kim; Han Byeol Bae; Jaeho Lee; Sangyoun Lee

doi:10.3390/s21227640

Sensors (Nov 2021)

An Efficient Approach Using Knowledge Distillation Methods to Stabilize Performance in a Lightweight Top-Down Posture Estimation Network

Changhyun Park,
Hean Sung Lee,
Woo Jin Kim,
Han Byeol Bae,
Jaeho Lee,
Sangyoun Lee

Affiliations

Changhyun Park: Department of Electrical and Electronic Engineering, Yonsei University, Seoul 03722, Korea
Hean Sung Lee: Department of Electrical and Electronic Engineering, Yonsei University, Seoul 03722, Korea
Woo Jin Kim: Department of Electrical and Electronic Engineering, Yonsei University, Seoul 03722, Korea
Han Byeol Bae: Department of Artificial Intelligence Convergence, Kwangju Women’s University, Gwangju 62396, Korea
Jaeho Lee: Department of Electrical and Electronic Engineering, Yonsei University, Seoul 03722, Korea
Sangyoun Lee: Department of Electrical and Electronic Engineering, Yonsei University, Seoul 03722, Korea

DOI: https://doi.org/10.3390/s21227640
Journal volume & issue: Vol. 21, no. 22
p. 7640

Abstract

Read online

Multi-person pose estimation has been gaining considerable interest due to its use in several real-world applications, such as activity recognition, motion capture, and augmented reality. Although the improvement of the accuracy and speed of multi-person pose estimation techniques has been recently studied, limitations still exist in balancing these two aspects. In this paper, a novel knowledge distilled lightweight top-down pose network (KDLPN) is proposed that balances computational complexity and accuracy. For the first time in multi-person pose estimation, a network that reduces computational complexity by applying a “Pelee” structure and shuffles pixels in the dense upsampling convolution layer to reduce the number of channels is presented. Furthermore, to prevent performance degradation because of the reduced computational complexity, knowledge distillation is applied to establish the pose estimation network as a teacher network. The method performance is evaluated on the MSCOCO dataset. Experimental results demonstrate that our KDLPN network significantly reduces 95% of the parameters required by state-of-the-art methods with minimal performance degradation. Moreover, our method is compared with other pose estimation methods to substantiate the importance of computational complexity reduction and its effectiveness.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords