Lightweight multi-stage temporal inference network for video crowd counting

Wei Gao; Wei Gao; Rui Feng; Xiaochun Sheng

doi:10.3389/fphy.2024.1489245

Frontiers in Physics (Nov 2024)

Lightweight multi-stage temporal inference network for video crowd counting

Wei Gao,
Wei Gao,
Rui Feng,
Xiaochun Sheng

Affiliations

Wei Gao: School of Educational Science, Yangzhou University, Yangzhou, China
Wei Gao: School of Computer Engineering, Jiangsu University of Technology, Changzhou, China
Rui Feng: School of Journalism and Communication, Yangzhou University, Yangzhou, China
Xiaochun Sheng: School of Computer Engineering, Jiangsu University of Technology, Changzhou, China

DOI: https://doi.org/10.3389/fphy.2024.1489245
Journal volume & issue: Vol. 12

Abstract

Read online

Crowd density is an important metric for preventing excessive crowding in a particular area, but it still faces challenges such as perspective distortion, scale variation, and pedestrian occlusion. Existing studies have attempted to model the spatio-temporal dependencies in videos using LSTM and 3D CNNs. However, these methods suffer from large computational costs, excessive parameter redundancy, and loss of temporal information, leading to difficulties in model convergence and limited recognition performance. To address these issues, we propose a lightweight multi-stage temporal inference network (LMSTIN) for video crowd counting. LMSTIN effectively models the spatio-temporal dependencies in video sequences at a fine-grained level, enabling real-time and accurate video crowd counting. Our proposed method achieves significant performance improvements on three public crowd counting datasets.

Published in Frontiers in Physics

ISSN: 2296-424X (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Science: Physics
Website: https://www.frontiersin.org/journals/physics

About the journal

Abstract

Keywords