CAAI Transactions on Intelligence Technology (Aug 2024)

GP‐FMLNet: A feature matrix learning network enhanced by glyph and phonetic information for Chinese sentiment analysis

  • Jing Li,
  • Dezheng Zhang,
  • Yonghong Xie,
  • Aziguli Wulamu,
  • Yao Zhang

DOI
https://doi.org/10.1049/cit2.12300
Journal volume & issue
Vol. 9, no. 4
pp. 960 – 972

Abstract

Read online

Abstract Sentiment analysis is a fine‐grained analysis task that aims to identify the sentiment polarity of a specified sentence. Existing methods in Chinese sentiment analysis tasks only consider sentiment features from a single pole and scale and thus cannot fully exploit and utilise sentiment feature information, making their performance less than ideal. To resolve the problem, the authors propose a new method, GP‐FMLNet, that integrates both glyph and phonetic information and design a novel feature matrix learning process for phonetic features with which to model words that have the same pinyin information but different glyph information. Our method solves the problem of misspelling words influencing sentiment polarity prediction results. Specifically, the authors iteratively mine character, glyph, and pinyin features from the input comments sentences. Then, the authors use soft attention and matrix compound modules to model the phonetic features, which empowers their model to keep on zeroing in on the dynamic‐setting words in various positions and to dispense with the impacts of the deceptive‐setting ones. Experiments on six public datasets prove that the proposed model fully utilises the glyph and phonetic information and improves on the performance of existing Chinese sentiment analysis algorithms.

Keywords