Jisuanji kexue yu tansuo (Dec 2023)

Robust Sentiment Analysis Model Based on Feature Representation in Uncertainty Domain

  • CHEN Jie, LI Shuai, ZHAO Shu, ZHANG Yanping

DOI
https://doi.org/10.3778/j.issn.1673-9418.2305077
Journal volume & issue
Vol. 17, no. 12
pp. 3020 – 3028

Abstract

Read online

In the sentiment classification of text data, there are often some fuzzy data that are difficult to classify. Due to their uncertainty, these fuzzy data appear to be over fitted during model training, which affects the robustness of the model. The three-way decision theories divide the initial sample into deterministic domains and uncertain domains, and how to select appropriate features for representation in the uncertain domain where the fuzzy data is located for downstream tasks is the challenge of the three-way decision sentiment analysis models. To address this challenge, a robust sentiment analysis model (UFR-SA) based on feature representation of three-way decision uncertainty domains is proposed. Firstly, based on the three-way decision theory, the deterministic domain and the uncertain domain are divided. For fuzzy samples in the uncertain domain, heterogeneous sample point pairs are defined to construct hierarchical features. Secondly, a hierarchical feature fusion model is designed to incorporate the advantages of each granularity feature into a multi-layer perceptual network. Finally, a divide and conquer strategy is adopted for test samples in the deterministic domain and the uncertain domain. The deterministic domain data are represented by the original features, and the fuzzy data in the uncertain domain are represented by the fused robust features.  Experimental results on SST-2, SST-5, and CR datasets show that UFR-SA effectively reduces the interference of fuzzy data on the model and outperforms the performance of state-of-the-art models.

Keywords