Video Sparse Transformer With Attention-Guided Memory for Video Object Detection

Masato Fujitake; Akihiro Sugimoto

doi:10.1109/ACCESS.2022.3184031

IEEE Access (Jan 2022)

Video Sparse Transformer With Attention-Guided Memory for Video Object Detection

Masato Fujitake,
Akihiro Sugimoto

Affiliations

Masato Fujitake: ORCiD; Department of Informatics, The Graduate University for Advanced Studies, SOKENDAI, Miura-gun, Japan
Akihiro Sugimoto: ORCiD; National Institute of Informatics, Tokyo, Japan

DOI: https://doi.org/10.1109/ACCESS.2022.3184031
Journal volume & issue: Vol. 10
pp. 65886 – 65900

Abstract

Read online

Detecting objects in a video, known as Video Object Detection (VOD), is challenging since appearance changes of objects over time may bring detection errors. Recent research has focused on aggregating features from adjacent frames to compensate for the deteriorated appearances of a frame. Moreover, using distant frames is also proposed to deal with deteriorated appearances over several frames. Since an object’s position may change significantly at a distant frame, they only use features of object candidate regions, which do not depend on their position. However, such methods rely on object candidate regions’ detection performance and are not practical for deteriorated appearances. In this paper, we enhance features element-wisely before the object candidate region detection, proposing Video Sparse Transformer with Attention-guided Memory (VSTAM). Furthermore, we propose aggregating element-wise features sparsely to reduce processing time and memory cost. In addition, we introduce an external memory update strategy based on the utilization of the aggregation to hold long-term information effectively. Our method achieved 8.3% and 11.1% accuracy gain from the baseline on ImageNet VID and UA-DETRAC datasets. Our method demonstrates superior performance against state-of-the-art results on widely used VOD datasets.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords