MATEC Web of Conferences (Jan 2017)

Chinese-Lao Bilingual Named Entity Alignment Research

  • Han Rui,
  • Zhou Lanjiang,
  • Zhou Feng,
  • Zhang Jinpeng

DOI
https://doi.org/10.1051/matecconf/201710002052
Journal volume & issue
Vol. 100
p. 02052

Abstract

Read online

Chinese-Lao bilingual NE alignment has a very important significance. Three entity alignment methods are proposed in this paper. Firstly, the paper proposes the similarity of bilingual entity fuzzy matching problem. Secondly, we use bilingual entity word sequence pattern similarity to propose Chinese entity model to match Lao entity method. Then we build a naïve Bayes bilingual NE alignment model to align Chinese and Lao named entity in the comparable corpus, by mining knowledge information words of Chinese entities. In the end, the rules combine the advantages of the three methods are proposed to achieve the best results.

Keywords