A study of transformer-based end-to-end speech recognition system for Kazakh language

Mamyrbayev Orken; Oralbekova Dina; Alimhan Keylan; Turdalykyzy Tolganay; Othman Mohamed

doi:10.1038/s41598-022-12260-y

Scientific Reports (May 2022)

A study of transformer-based end-to-end speech recognition system for Kazakh language

Mamyrbayev Orken,
Oralbekova Dina,
Alimhan Keylan,
Turdalykyzy Tolganay,
Othman Mohamed

Affiliations

Mamyrbayev Orken: Institute of Information and Computational Technologies CS MES RK
Oralbekova Dina: Institute of Information and Computational Technologies CS MES RK
Alimhan Keylan: Institute of Information and Computational Technologies CS MES RK
Turdalykyzy Tolganay: Institute of Information and Computational Technologies CS MES RK
Othman Mohamed: Universiti Putra Malaysia

DOI: https://doi.org/10.1038/s41598-022-12260-y
Journal volume & issue: Vol. 12, no. 1
pp. 1 – 11

Abstract

Read online

Abstract Today, the Transformer model, which allows parallelization and also has its own internal attention, has been widely used in the field of speech recognition. The great advantage of this architecture is the fast learning speed, and the lack of sequential operation, as with recurrent neural networks. In this work, Transformer models and an end-to-end model based on connectionist temporal classification were considered to build a system for automatic recognition of Kazakh speech. It is known that Kazakh is part of a number of agglutinative languages and has limited data for implementing speech recognition systems. Some studies have shown that the Transformer model improves system performance for low-resource languages. Based on our experiments, it was revealed that the joint use of Transformer and connectionist temporal classification models contributed to improving the performance of the Kazakh speech recognition system and with an integrated language model it showed the best character error rate 3.7% on a clean dataset.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal