IET Image Processing (Jul 2024)

Video object segmentation via couple streams and feature memory

  • Yun Liang,
  • Xinjie Xiao,
  • Shaojian Qiu,
  • Yuqing Zhang,
  • Zhuo Su

DOI
https://doi.org/10.1049/ipr2.13051
Journal volume & issue
Vol. 18, no. 9
pp. 2257 – 2272

Abstract

Read online

Abstract In recent years, most video segmentation methods use deep CNN to process the input image, but they did not fully mine the rich intermediate predictions in spatio‐temporal space. And, the segmentation challenges such as occlusion, severe deformation and illumination have not been well solved so far. To alleviate these problems, this paper focuses on constructing multi module network structures that represent multi semantics and proposes a video object segmentation network via coupled‐stream architecture with feature memory mechanism. This network first extracts high‐level semantic features, edge features, long‐term and short‐term stable depth features of the target, and then decode them into the segmentation mask of target. In addition, negative skeleton inhibition and frame interpolation are used to prevent the interference of similar objects and motion blur, respectively. The method has a low GPU memory usage, regardless of the number of object in video. And performs 86.5%and 62.4% in J&F measure on DAVIS 2016 and DAVIS 2017 validation set, without fine‐tuning and online training.

Keywords