Universal Foreground Segmentation Based on Deep Feature Fusion Network for Multi-Scene Videos

Ye Tao; Zhihao Ling; Ioannis Patras

doi:10.1109/ACCESS.2019.2950639

IEEE Access (Jan 2019)

Universal Foreground Segmentation Based on Deep Feature Fusion Network for Multi-Scene Videos

Ye Tao,
Zhihao Ling,
Ioannis Patras

Affiliations

Ye Tao: ORCiD; Key Laboratory of Advanced Control and Optimization for Chemical Processes, Ministry of Education, East China University of Science and Technology, Shanghai, China
Zhihao Ling: Key Laboratory of Advanced Control and Optimization for Chemical Processes, Ministry of Education, East China University of Science and Technology, Shanghai, China
Ioannis Patras: School of Electronic Engineering and Computer Science, Queen Mary University of London, London, U.K.

DOI: https://doi.org/10.1109/ACCESS.2019.2950639
Journal volume & issue: Vol. 7
pp. 158326 – 158337

Abstract

Read online

Foreground/background (fg/bg) classification is an important first step for several video analysis tasks such as people counting, activity recognition and anomaly detection. As is the case for several other Computer Vision problems, the advent of deep Convolutional Neural Network (CNN) methods has led to major improvements in this field. However, despite their success, CNN-based methods have difficulties in coping with multi-scene videos where the scenes change multiple times along the time sequence. In this paper, we propose a deep features fusion network based foreground segmentation method (DFFnetSeg), which is both robust to scene changes and unseen scenes comparing with competitive state-of-the-art methods. In the heart of DFFnetSeg lies a fusion network that takes as input deep features extracted from a current frame, a previous frame, and a reference frame and produces as output a segmentation mask into background and foreground objects. We show the advantages of using a fusion network and the three frames group in dealing with the unseen scene and bootstrap challenge. In addition, we show that a simple reference frame updating strategy enables DFFnetSeg to be robust to sudden scene changes inside video sequences and prepare a motion map based post-processing method which further reduces false positives. Experimental results on the test dataset generated from CDnet2014 and Lasiesta demonstrate the advantages of the DFFnetSeg method.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords