Robust Sentiment Analysis Model Based on Feature Representation in Uncertainty Domain

CHEN Jie, LI Shuai, ZHAO Shu, ZHANG Yanping

doi:10.3778/j.issn.1673-9418.2305077

Jisuanji kexue yu tansuo (Dec 2023)

Robust Sentiment Analysis Model Based on Feature Representation in Uncertainty Domain

CHEN Jie, LI Shuai, ZHAO Shu, ZHANG Yanping

Affiliations

CHEN Jie, LI Shuai, ZHAO Shu, ZHANG Yanping: School of Computer Science and Technology, Anhui University, Hefei 230601, China

DOI: https://doi.org/10.3778/j.issn.1673-9418.2305077
Journal volume & issue: Vol. 17, no. 12
pp. 3020 – 3028

Abstract

Read online

In the sentiment classification of text data, there are often some fuzzy data that are difficult to classify. Due to their uncertainty, these fuzzy data appear to be over fitted during model training, which affects the robustness of the model. The three-way decision theories divide the initial sample into deterministic domains and uncertain domains, and how to select appropriate features for representation in the uncertain domain where the fuzzy data is located for downstream tasks is the challenge of the three-way decision sentiment analysis models. To address this challenge, a robust sentiment analysis model (UFR-SA) based on feature representation of three-way decision uncertainty domains is proposed. Firstly, based on the three-way decision theory, the deterministic domain and the uncertain domain are divided. For fuzzy samples in the uncertain domain, heterogeneous sample point pairs are defined to construct hierarchical features. Secondly, a hierarchical feature fusion model is designed to incorporate the advantages of each granularity feature into a multi-layer perceptual network. Finally, a divide and conquer strategy is adopted for test samples in the deterministic domain and the uncertain domain. The deterministic domain data are represented by the original features, and the fuzzy data in the uncertain domain are represented by the fused robust features. Experimental results on SST-2, SST-5, and CR datasets show that UFR-SA effectively reduces the interference of fuzzy data on the model and outperforms the performance of state-of-the-art models.

sentiment analysis; three-way decision; robustness; multi-granularity feature representation; feature fusion

Published in Jisuanji kexue yu tansuo

ISSN: 1673-9418 (Print)
Publisher: Journal of Computer Engineering and Applications Beijing Co., Ltd., Science Press
Country of publisher: China
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://fcst.ceaj.org

About the journal

Abstract

Keywords