A temporal shift reconstruction network for compressive video sensing

Zhenfei Gu; Chao Zhou; Guofeng Lin

doi:10.1049/cvi2.12234

IET Computer Vision (Jun 2024)

A temporal shift reconstruction network for compressive video sensing

Zhenfei Gu,
Chao Zhou,
Guofeng Lin

Affiliations

Zhenfei Gu: School of Electronic Information Nanjing Vocational College of Information Technology Nanjing China
Chao Zhou: School of Internet of Things Nanjing University of Posts and Telecommunications Nanjing China
Guofeng Lin: Nanjing LES Information Technology Co., LTD Nanjing China

DOI: https://doi.org/10.1049/cvi2.12234
Journal volume & issue: Vol. 18, no. 4
pp. 448 – 457

Abstract

Read online

Abstract Compressive sensing provides a promising sampling paradigm for video acquisition for resource‐limited sensor applications. However, the reconstruction of original video signals from sub‐sampled measurements is still a great challenge. To exploit the temporal redundancies within videos during the recovery, previous works tend to perform alignment on initial reconstructions, which are too coarse to provide accurate motion estimations. To solve this problem, the authors propose a novel reconstruction network, named TSRN, for compressive video sensing. Specifically, the authors utilise a number of stacked temporal shift reconstruction blocks (TSRBs) to enhance the initial reconstruction progressively. Each TSRB could learn the temporal structures by exchanging information with last and next time step, and no additional computations is imposed on the network compared to regular 2D convolutions due to the high efficiency of temporal shift operations. After the enhancement, a bidirectional alignment module to build accurate temporal dependencies directly with the help of optical flows is employed. Different from previous methods that only extract supplementary information from the key frames, the proposed alignment module can receive temporal information from the whole video sequence via bidirectional propagations, thus yielding better performance. Experimental results verify the superiority of the proposed method over other state‐of‐the‐art approaches quantitatively and qualitatively.

Published in IET Computer Vision

ISSN: 1751-9632 (Print); 1751-9640 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://ietresearch.onlinelibrary.wiley.com/journal/17519640

About the journal

Abstract

Keywords