MSF-NET: Foreground Objects Detection With Fusion of Motion and Semantic Features

Jae-Yeul Kim; Jong-Eun Ha

doi:10.1109/ACCESS.2023.3345842

IEEE Access (Jan 2023)

MSF-NET: Foreground Objects Detection With Fusion of Motion and Semantic Features

Jae-Yeul Kim,
Jong-Eun Ha

Affiliations

Jae-Yeul Kim: ORCiD; Graduate School of Information and Communication Engineering, Daegu Gyeongbuk Institute of Science and Technology (DGIST), Daegu, South Korea
Jong-Eun Ha: ORCiD; Department of Mechanical and Automotive Engineering, Seoul National University of Science and Technology, Seoul, South Korea

DOI: https://doi.org/10.1109/ACCESS.2023.3345842
Journal volume & issue: Vol. 11
pp. 145551 – 145565

Abstract

Read online

Visual surveillance requires robust detection of foreground objects under challenging environments of abrupt lighting variation, stationary foreground objects, dynamic background objects, and severe weather conditions. Most classical algorithms leverage background model images produced by statistical modeling of the change of brightness values over time. Since they have difficulties using global features, many false detections occur at the stationary foreground regions and dynamic background objects. Recent deep learning-based methods can easily reflect global characteristics compared to classical methods. However, deep learning-based methods still need to be improved in utilizing spatiotemporal information. We propose an algorithm for efficiently using spatiotemporal information by adopting a split and merge framework. First, we split spatiotemporal information on successive multiple images into spatial and temporal parts using two sub-networks of semantic and motion networks. Finally, separated information is fused in a spatiotemporal fusion network. The proposed network consists of three sub-networks, which we note as MSF-NET (Motion and Semantic features Fusion NETwork). Also, we propose a method to train the proposed MSF-NET stably. Compared to the latest deep learning algorithms, the proposed MSF-NET gives 9% and 13% higher FM in the LASIESTA and SBI datasets. Also, we designed the proposed MSF-NET to be lightweight to run in real-time on a desktop GPU.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords