JudPriNet: Video transition detection based on semantic relationship and Monte Carlo sampling

Bo Ma; Jinsong Wu; Wei Qi Yan

doi:10.23919/ICN.2024.0010

Intelligent and Converged Networks (Jun 2024)

JudPriNet: Video transition detection based on semantic relationship and Monte Carlo sampling

Bo Ma,
Jinsong Wu,
Wei Qi Yan

Affiliations

Bo Ma: School of Engineering, Computer and Mathematical Sciences, Auckland University of Technology, Auckland 1024, New Zealand
Jinsong Wu: School of Artificial Intelligence, Guilin University of Electronic Technology, Guilin 540004, China, and also with the Department of Electrical Engineering, University of Chile, Santiago 8010037, Chile
Wei Qi Yan: School of Engineering, Computer and Mathematical Sciences, Auckland University of Technology, Auckland 1024, New Zealand

DOI: https://doi.org/10.23919/ICN.2024.0010
Journal volume & issue: Vol. 5, no. 2
pp. 134 – 146

Abstract

Read online

Video understanding and content boundary detection are vital stages in video recommendation. However, previous content boundary detection methods require collecting information, including location, cast, action, and audio, and if any of these elements are missing, the results may be adversely affected. To address this issue and effectively detect transitions in video content, in this paper, we introduce a video classification and boundary detection method named JudPriNet. The focus of this paper is on objects in videos along with their labels, enabling automatic scene detection in video clips and establishing semantic connections among local objects in the images. As a significant contribution, JudPriNet presents a framework that maps labels to “Continuous Bag of Visual Words Model” to cluster labels and generates new standardized labels as video-type tags. This facilitates automatic classification of video clips. Furthermore, JudPriNet employs Monte Carlo sampling method to classify video clips, the features of video clips as elements within the framework. This proposed method seamlessly integrates video and textual components without compromising training and inference speed. Through experimentation, we have demonstrated that JudPriNet, with its semantic connections, is able to effectively classify videos alongside textual content. Our results indicate that, compared with several other detection approaches, JudPriNet excels in high-level content detection without disrupting the integrity of the video content, outperforming existing methods.

Published in Intelligent and Converged Networks

ISSN: 2708-6240 (Online)
Publisher: Tsinghua University Press
Country of publisher: China
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Telecommunication
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=9195266

About the journal

Abstract

Keywords