RQ-OSPTrans: A Semantic Classification Method Based on Transformer That Combines Overall Semantic Perception and “Repeated Questioning” Learning Mechanism

Yuanjun Tan; Quanling Liu; Tingting Liu; Hai Liu; Shengming Wang; Zengzhao Chen

doi:10.3390/app14104259

Applied Sciences (May 2024)

RQ-OSPTrans: A Semantic Classification Method Based on Transformer That Combines Overall Semantic Perception and “Repeated Questioning” Learning Mechanism

Yuanjun Tan,
Quanling Liu,
Tingting Liu,
Hai Liu,
Shengming Wang,
Zengzhao Chen

Affiliations

Yuanjun Tan: Faculty of Artificial Intelligence in Education, Central China Normal University, Wuhan 430079, China
Quanling Liu: Faculty of Artificial Intelligence in Education, Central China Normal University, Wuhan 430079, China
Tingting Liu: School of Education, Hubei University, Wuhan 430072, China
Hai Liu: Faculty of Artificial Intelligence in Education, Central China Normal University, Wuhan 430079, China
Shengming Wang: Faculty of Artificial Intelligence in Education, Central China Normal University, Wuhan 430079, China
Zengzhao Chen: Faculty of Artificial Intelligence in Education, Central China Normal University, Wuhan 430079, China

DOI: https://doi.org/10.3390/app14104259
Journal volume & issue: Vol. 14, no. 10
p. 4259

Abstract

Read online

The pre-trained language model based on Transformers possesses exceptional general text-understanding capabilities, empowering it to adeptly manage a variety of tasks. However, the topic classification ability of the pre-trained language model will be seriously affected in the face of long colloquial texts, expressions with similar semantics but completely different expressions, and text errors caused by partial speech recognition. We propose a long-text topic classification method called RQ-OSPTrans to effectively address these challenges. To this end, two parallel learning modules are proposed to learn long texts, namely, the repeat question module and the overall semantic perception module. The overall semantic perception module will conduct average pooling on the semantic embeddings produced by BERT, in addition to multi-layer perceptron learning. The repeat question module will learn the text-embedding matrix, extracting detailed clues for classification based on words as fundamental elements. Comprehensive experiments demonstrate that RQ-OSPTrans can achieve a generalization performance of 98.5% on the Chinese dataset THUCNews. Moreover, RQ-OSPTrans can achieve state-of-the-art performance on the arXiv-10 dataset (84.4%) and has a comparable performance with other state-of-the-art pre-trained models on the AG’s News dataset. Finally, the results indicate that our method exhibits a superior performance compared with the baseline methods on small-scale domain-specific datasets by validating RQ-OSPTrans on a specific task scenario by using our custom-built dataset CCIPC.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords