IEEE Access (Jan 2020)
Feature-Enhanced Nonequilibrium Bidirectional Long Short-Term Memory Model for Chinese Text Classification
Abstract
This article proposes a model for Chinese text classification based on a feature-enhanced nonequilibrium bidirectional long short-term memory (Bi-LSTM) network that analyzes Chinese text information in depth and improves the accuracy of text classification. First, the bidirectional encoder representations from transformers model was used to vectorize the original Chinese corpus and extract preliminary semantic features. Then, a nonequilibrium Bi-LSTM network was applied to increase the weight of text information containing important semantics and further improve the effects of the key features in Chinese text classification. Simultaneously, a hierarchical attention mechanism was used to widen the gap between the important and unimportant data. Finally, the softmax function was used for classification. By comparing the classification performance of the proposed scheme with those of various other models, it was observed that the model substantially improved the precision of Chinese text classification and had a strong ability to recognize Chinese text features. The model achieved 97% precision on the experimental dataset.
Keywords