Sentiment analysis of video danmakus based on MIBE-RoBERTa-FF-BiLSTM

Jianbo Zhao; Huailiang Liu; Yakai Wang; Weili Zhang; Xiaojin Zhang; Bowei Li; Tong Sun; Yanwei Qi; Shanzhuang Zhang

doi:10.1038/s41598-024-56518-z

Scientific Reports (Mar 2024)

Sentiment analysis of video danmakus based on MIBE-RoBERTa-FF-BiLSTM

Jianbo Zhao,
Huailiang Liu,
Yakai Wang,
Weili Zhang,
Xiaojin Zhang,
Bowei Li,
Tong Sun,
Yanwei Qi,
Shanzhuang Zhang

Affiliations

Jianbo Zhao: School of Economics and Management, Xidian University
Huailiang Liu: School of Economics and Management, Xidian University
Yakai Wang: School of Economics and Management, Xidian University
Weili Zhang: School of Economics and Management, Xidian University
Xiaojin Zhang: School of Economics and Management, Xidian University
Bowei Li: School of Telecommunications Engineering, Xidian University
Tong Sun: School of Economics and Management, Xidian University
Yanwei Qi: School of Economics and Management, Xidian University
Shanzhuang Zhang: School of Economics and Management, Xidian University

DOI: https://doi.org/10.1038/s41598-024-56518-z
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 16

Abstract

Read online

Abstract Danmakus are user-generated comments that overlay on videos, enabling real-time interactions between viewers and video content. The emotional orientation of danmakus can reflect the attitudes and opinions of viewers on video segments, which can help video platforms optimize video content recommendation and evaluate users’ abnormal emotion levels. Aiming at the problems of low transferability of traditional sentiment analysis methods in the danmaku domain, low accuracy of danmaku text segmentation, poor consistency of sentiment annotation, and insufficient semantic feature extraction, this paper proposes a video danmaku sentiment analysis method based on MIBE-RoBERTa-FF-BiLSTM. This paper constructs a “Bilibili Must-Watch List and Top Video Danmaku Sentiment Dataset” by ourselves, covering 10,000 positive and negative sentiment danmaku texts of 18 themes. A new word recognition algorithm based on mutual information (MI) and branch entropy (BE) is used to discover 2610 irregular network popular new words from trigrams to heptagrams in the dataset, forming a domain lexicon. The Maslow’s hierarchy of needs theory is applied to guide the consistent sentiment annotation. The domain lexicon is integrated into the feature fusion layer of the RoBERTa-FF-BiLSTM model to fully learn the semantic features of word information, character information, and context information of danmaku texts and perform sentiment classification. Comparative experiments on the dataset show that the model proposed in this paper has the best comprehensive performance among the mainstream models for video danmaku text sentiment classification, with an F1 value of 94.06%, and its accuracy and robustness are also better than other models. The limitations of this paper are that the construction of the domain lexicon still requires manual participation and review, the semantic information of danmaku video content and the positive case preference are ignored.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal

Abstract

Keywords