MATEC Web of Conferences (Jan 2021)

Selection of acoustic modeling unit for Tibetan speech recognition based on deep learning

  • Gong Baojia,
  • Cai Rangzhuoma,
  • Cai Zhijie,
  • Ding Yuntao,
  • Peng Maozhaxi

DOI
https://doi.org/10.1051/matecconf/202133606014
Journal volume & issue
Vol. 336
p. 06014

Abstract

Read online

The selection of the speech recognition modeling unit is the primary problem of acoustic modeling in speech recognition, and different acoustic modeling units will directly affect the overall performance of speech recognition. This paper designs the Tibetan character segmentation and labeling model and algorithm flow for the purpose of solving the problem of selecting the acoustic modeling unit in Tibetan speech recognition by studying and analyzing the deficiencies of the existing acoustic modeling units in Tibetan speech recognition. After experimental verification, the Tibetan character segmentation and labeling model and algorithm achieved good performance of character segmentation and labeling, and the accuracy of Tibetan character segmentation and labeling reached 99.98%, respectively.