Adaptive temporal compression for reduction of computational complexity in human behavior recognition

Haixin Huang; Yuyao Wang; Mingqi Cai; Ruipeng Wang; Feng Wen; Xiaojie Hu

doi:10.1038/s41598-024-61286-x

Scientific Reports (May 2024)

Adaptive temporal compression for reduction of computational complexity in human behavior recognition

Haixin Huang,
Yuyao Wang,
Mingqi Cai,
Ruipeng Wang,
Feng Wen,
Xiaojie Hu

Affiliations

Haixin Huang: School of Automation and Electrical Engineering, Shenyang Ligong University
Yuyao Wang: School of Automation and Electrical Engineering, Shenyang Ligong University
Mingqi Cai: School of Automation and Electrical Engineering, Shenyang Ligong University
Ruipeng Wang: School of Automation and Electrical Engineering, Shenyang Ligong University
Feng Wen: School of Information Science and Engineering, Shenyang Ligong University
Xiaojie Hu: School of Information Science and Engineering, Shenyang Ligong University

DOI: https://doi.org/10.1038/s41598-024-61286-x
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 11

Abstract

Read online

Abstract The research on video analytics especially in the area of human behavior recognition has become increasingly popular recently. It is widely applied in virtual reality, video surveillance, and video retrieval. With the advancement of deep learning algorithms and computer hardware, the conventional two-dimensional convolution technique for training video models has been replaced by three-dimensional convolution, which enables the extraction of spatio-temporal features. Specifically, the use of 3D convolution in human behavior recognition has been the subject of growing interest. However, the increased dimensionality has led to challenges such as the dramatic increase in the number of parameters, increased time complexity, and a strong dependence on GPUs for effective spatio-temporal feature extraction. The training speed can be considerably slow without the support of powerful GPU hardware. To address these issues, this study proposes an Adaptive Time Compression (ATC) module. Functioning as an independent component, ATC can be seamlessly integrated into existing architectures and achieves data compression by eliminating redundant frames within video data. The ATC module effectively reduces GPU computing load and time complexity with negligible loss of accuracy, thereby facilitating real-time human behavior recognition.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal

Abstract

Keywords