Enhancing sentiment analysis of online comments: a novel approach integrating topic modeling and deep learning

Yan Zhu; Rui Zhou; Gang Chen; Baili Zhang

doi:10.7717/peerj-cs.2542

PeerJ Computer Science (Dec 2024)

Enhancing sentiment analysis of online comments: a novel approach integrating topic modeling and deep learning

Yan Zhu,
Rui Zhou,
Gang Chen,
Baili Zhang

Affiliations

Yan Zhu: School of Design and Art, Shanghai Dianji University, Shanghai, Shanghai, China
Rui Zhou: School of Design and Art, Shanghai Dianji University, Shanghai, Shanghai, China
Gang Chen: Infrastructure Technology Business Group, Ant Group, Hangzhou, Zhejiang, China
Baili Zhang: School of Computer Science and Engineering, Southeast University, Nanjing, Jiangsu, China

DOI: https://doi.org/10.7717/peerj-cs.2542
Journal volume & issue: Vol. 10
p. e2542

Abstract

Read online Read online

Traditional statistical learning-based sentiment analysis methods often struggle to effectively handle text relevance and temporality. To overcome these limitations, this paper proposes a novel approach integrating Latent Dirichlet Allocation (LDA), Shuffle-enhanced Real-Valued Non-Volume Preserving (RealNVP), a double-layer bidirectional improved Long Short-Term Memory (DBiLSTM) network, and a multi-head self-attention mechanism for sentiment analysis. LDA is employed to extract latent topics within comment texts, revealing text relevance and providing fine-grained user feedback. Shuffle enhancement is applied to RealNVP to effectively model the distribution of text topic features, enhancing performance while avoiding excessive complexity in model structure and computational overhead. The double-layer bidirectional improved LSTM, through the coupling of forget and input gates, captures the dynamic temporal changes in sentiment with greater flexibility. The multi-head self-attention mechanism enhances the model’s ability to select and focus on key information, thereby more accurately reflecting user experiences. Experimental results on both Chinese and English online comment datasets demonstrate that the proposed integrated model achieves improved topic coherence compared to traditional LDA models, effectively mitigating overfitting. Furthermore, the model outperforms single models and other baselines in sentiment classification tasks, as evidenced by superior accuracy and F1 scores. These results underscore the model’s effectiveness for both Chinese and English sentiment analysis in the context of online comments.

Published in PeerJ Computer Science

ISSN: 2376-5992 (Online)
Publisher: PeerJ Inc.
Country of publisher: United States
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://peerj.com/computer-science/

About the journal

Abstract

Keywords