ETRI Journal (Feb 2024)

Transformer-based reranking for improving Korean morphological analysis systems

  • Jihee Ryu,
  • Soojong Lim,
  • Oh-Woog Kwon,
  • Seung-Hoon Na

DOI
https://doi.org/10.4218/etrij.2023-0364
Journal volume & issue
Vol. 46, no. 1
pp. 137 – 153

Abstract

Read online

This study introduces a new approach in Korean morphological analysis combining dictionary-based techniques with Transformer-based deep learning models. The key innovation is the use of a BERT-based reranking system, significantly enhancing the accuracy of traditional morphological analysis. The method generates multiple suboptimal paths, then employs BERT models for reranking, leveraging their advanced language comprehension. Results show remarkable performance improvements, with the first-stage reranking achieving over 20% improvement in error reduction rate compared with existing models. The second stage, using another BERT variant, further increases this improvement to over 30%. This indicates a significant leap in accuracy, validating the effectiveness of merging dictionary-based analysis with contemporary deep learning. The study suggests future exploration in refined integrations of dictionary and deep learning methods as well as using probabilistic models for enhanced morphological analysis. This hybrid approach sets a new benchmark in the field and offers insights for similar challenges in language processing applications.

Keywords