Exploring Global Diversity and Local Context for Video Summarization

Yingchao Pan; Ouhan Huang; Qinghao Ye; Zhongjin Li; Wenjiang Wang; Guodun Li; Yuxing Chen

doi:10.1109/ACCESS.2022.3163414

IEEE Access (Jan 2022)

Exploring Global Diversity and Local Context for Video Summarization

Yingchao Pan,
Ouhan Huang,
Qinghao Ye,
Zhongjin Li,
Wenjiang Wang,
Guodun Li,
Yuxing Chen

Affiliations

Yingchao Pan: School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, China
Ouhan Huang: ORCiD; School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, China
Qinghao Ye: ORCiD; School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, China
Zhongjin Li: ORCiD; School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, China
Wenjiang Wang: School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, China
Guodun Li: School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, China
Yuxing Chen: School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, China

DOI: https://doi.org/10.1109/ACCESS.2022.3163414
Journal volume & issue: Vol. 10
pp. 43611 – 43622

Abstract

Read online

Video summarization aims to automatically generate a diverse and concise summary which is useful in large-scale video processing. Most of the methods tend to adopt self-attention mechanism across video frames, which fails to model the diversity of video frames. To alleviate this problem, we revisit the pairwise similarity measurement in self-attention mechanism and find that the existing inner-product affinity leads to discriminative features rather than diversified features. In light of this phenomenon, we propose global diverse attention which uses the squared Euclidean distance instead to compute the affinities. Moreover, we model the local contextual information by novel local contextual attention to remove the redundancy in the video. By combining these two attention mechanisms, a video SUMmarization model with Diversified Contextual Attention scheme is developed, namely SUM-DCA. Extensive experiments are conducted on benchmark data sets to verify the effectiveness and the superiority of SUM-DCA in terms of F-score and rank-based evaluation without any bells and whistles.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords