Cognitive Video Surveillance Management in Hierarchical Edge Computing System with Long Short-Term Memory Model

Dilshod Bazarov Ravshan Ugli; Jingyeom Kim; Alaelddin F. Y. Mohammed; Joohyung Lee

doi:10.3390/s23052869

Sensors (Mar 2023)

Cognitive Video Surveillance Management in Hierarchical Edge Computing System with Long Short-Term Memory Model

Dilshod Bazarov Ravshan Ugli,
Jingyeom Kim,
Alaelddin F. Y. Mohammed,
Joohyung Lee

Affiliations

Dilshod Bazarov Ravshan Ugli: Department of Computing, Gachon University, Seongnam-si 13120, Republic of Korea
Jingyeom Kim: Advanced Research Team, NHN Cloud Corp., Seongnam-si 13487, Republic of Korea
Alaelddin F. Y. Mohammed: Department of Computing, Gachon University, Seongnam-si 13120, Republic of Korea
Joohyung Lee: Department of Computing, Gachon University, Seongnam-si 13120, Republic of Korea

DOI: https://doi.org/10.3390/s23052869
Journal volume & issue: Vol. 23, no. 5
p. 2869

Abstract

Read online

Nowadays, deep learning (DL)-based video surveillance services are widely used in smart cities because of their ability to accurately identify and track objects, such as vehicles and pedestrians, in real time. This allows a more efficient traffic management and improved public safety. However, DL-based video surveillance services that require object movement and motion tracking (e.g., for detecting abnormal object behaviors) can consume a substantial amount of computing and memory capacity, such as (i) GPU computing resources for model inference and (ii) GPU memory resources for model loading. This paper presents a novel cognitive video surveillance management with long short-term memory (LSTM) model, denoted as the CogVSM framework. We consider DL-based video surveillance services in a hierarchical edge computing system. The proposed CogVSM forecasts object appearance patterns and smooths out the forecast results needed for an adaptive model release. Here, we aim to reduce standby GPU memory by model release while avoiding unnecessary model reloads for a sudden object appearance. CogVSM hinges on an LSTM-based deep learning architecture explicitly designed for future object appearance pattern prediction by training previous time-series patterns to achieve these objectives. By referring to the result of the LSTM-based prediction, the proposed framework controls the threshold time value in a dynamic manner by using an exponential weighted moving average (EWMA) technique. Comparative evaluations on both simulated and real-world measurement data on the commercial edge devices prove that the LSTM-based model in the CogVSM can achieve a high predictive accuracy, i.e., a root-mean-square error metric of 0.795. In addition, the suggested framework utilizes up to 32.1% less GPU memory than the baseline and 8.9% less than previous work.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords