Application of the transformer model algorithm in chinese word sense disambiguation: a case study in chinese language

Linlin Li; Juxing Li; Hongli Wang; Jianing Nie

doi:10.1038/s41598-024-56976-5

Scientific Reports (Mar 2024)

Application of the transformer model algorithm in chinese word sense disambiguation: a case study in chinese language

Linlin Li,
Juxing Li,
Hongli Wang,
Jianing Nie

Affiliations

Linlin Li: The College of Literature and Journalism, Sichuan University
Juxing Li: School of Journalism and New Media, Xi’an Jiaotong University
Hongli Wang: School of Artificial Intelligence, Tiangong University
Jianing Nie: School of Art, College of International Business and Economics, Wuhan Textile University

DOI: https://doi.org/10.1038/s41598-024-56976-5
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 15

Abstract

Read online

Abstract This study aims to explore the research methodology of applying the Transformer model algorithm to Chinese word sense disambiguation, seeking to resolve word sense ambiguity in the Chinese language. The study introduces deep learning and designs a Chinese word sense disambiguation model based on the fusion of the Transformer with the Bi-directional Long Short-Term Memory (BiLSTM) algorithm. By utilizing the self-attention mechanism of Transformer and the sequence modeling capability of BiLSTM, this model efficiently captures semantic information and context relationships in Chinese sentences, leading to accurate word sense disambiguation. The model’s evaluation is conducted using the PKU Paraphrase Bank, a Chinese text paraphrase dataset. The results demonstrate that the model achieves a precision rate of 83.71% in Chinese word sense disambiguation, significantly outperforming the Long Short-Term Memory algorithm. Additionally, the root mean squared error of this algorithm is less than 17, with a loss function value remaining around 0.14. Thus, this study validates that the constructed Transformer-fused BiLSTM-based Chinese word sense disambiguation model algorithm exhibits both high accuracy and robustness in identifying word senses in the Chinese language. The findings of this study provide valuable insights for advancing the intelligent development of word senses in Chinese language applications.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal

Abstract

Keywords