Video object segmentation via couple streams and feature memory

Yun Liang; Xinjie Xiao; Shaojian Qiu; Yuqing Zhang; Zhuo Su

doi:10.1049/ipr2.13051

IET Image Processing (Jul 2024)

Video object segmentation via couple streams and feature memory

Yun Liang,
Xinjie Xiao,
Shaojian Qiu,
Yuqing Zhang,
Zhuo Su

Affiliations

Yun Liang: College of Mathematics and Informatics South China Agricultural University Guangzhou China
Xinjie Xiao: College of Mathematics and Informatics South China Agricultural University Guangzhou China
Shaojian Qiu: College of Mathematics and Informatics South China Agricultural University Guangzhou China
Yuqing Zhang: School of Control Science and Engineering Beijing University of Technology Beijing China
Zhuo Su: School of Control Science and Engineering Sun Yat‐sen University Guangzhou China

DOI: https://doi.org/10.1049/ipr2.13051
Journal volume & issue: Vol. 18, no. 9
pp. 2257 – 2272

Abstract

Read online

Abstract In recent years, most video segmentation methods use deep CNN to process the input image, but they did not fully mine the rich intermediate predictions in spatio‐temporal space. And, the segmentation challenges such as occlusion, severe deformation and illumination have not been well solved so far. To alleviate these problems, this paper focuses on constructing multi module network structures that represent multi semantics and proposes a video object segmentation network via coupled‐stream architecture with feature memory mechanism. This network first extracts high‐level semantic features, edge features, long‐term and short‐term stable depth features of the target, and then decode them into the segmentation mask of target. In addition, negative skeleton inhibition and frame interpolation are used to prevent the interference of similar objects and motion blur, respectively. The method has a low GPU memory usage, regardless of the number of object in video. And performs 86.5%and 62.4% in J&F measure on DAVIS 2016 and DAVIS 2017 validation set, without fine‐tuning and online training.

Published in IET Image Processing

ISSN: 1751-9659 (Print); 1751-9667 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Technology: Photography; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://ietresearch.onlinelibrary.wiley.com/journal/17519667

About the journal

Abstract

Keywords