Phonetic Variation Modeling and a Language Model Adaptation for Korean English Code-Switching Speech Recognition

Damheo Lee; Donghyun Kim; Seung Yun; Sanghun Kim

doi:10.3390/app11062866

Applied Sciences (Mar 2021)

Phonetic Variation Modeling and a Language Model Adaptation for Korean English Code-Switching Speech Recognition

Damheo Lee,
Donghyun Kim,
Seung Yun,
Sanghun Kim

Affiliations

Damheo Lee: Software Development Department, IIR TECH Inc., Daejeon 34134, Korea
Donghyun Kim: Artificial Intelligence Research Laboratory, Electronics and Telecommunications Research Institute, Daejeon 34129, Korea
Seung Yun: Artificial Intelligence Research Laboratory, Electronics and Telecommunications Research Institute, Daejeon 34129, Korea
Sanghun Kim: Artificial Intelligence Research Laboratory, Electronics and Telecommunications Research Institute, Daejeon 34129, Korea

DOI: https://doi.org/10.3390/app11062866
Journal volume & issue: Vol. 11, no. 6
p. 2866

Abstract

Read online

In this paper, we propose a new method for code-switching (CS) automatic speech recognition (ASR) in Korean. First, the phonetic variations in English pronunciation spoken by Korean speakers should be considered. Thus, we tried to find a unified pronunciation model based on phonetic knowledge and deep learning. Second, we extracted the CS sentences semantically similar to the target domain and then applied the language model (LM) adaptation to solve the biased modeling toward Korean due to the imbalanced training data. In this experiment, training data were AI Hub (1033 h) in Korean and Librispeech (960 h) in English. As a result, when compared to the baseline, the proposed method improved the error reduction rate (ERR) by up to 11.6% with phonetic variant modeling and by 17.3% when semantically similar sentences were applied to the LM adaptation. If we considered only English words, the word correction rate improved up to 24.2% compared to that of the baseline. The proposed method seems to be very effective in CS speech recognition.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords