Semi-Supervised Speech Recognition Acoustic Model Training Using Policy Gradient

Hoon Chung; Sung Joo Lee; Hyeong Bae Jeon; Jeon Gue Park

doi:10.3390/app10103542

Applied Sciences (May 2020)

Semi-Supervised Speech Recognition Acoustic Model Training Using Policy Gradient

Hoon Chung,
Sung Joo Lee,
Hyeong Bae Jeon,
Jeon Gue Park

Affiliations

Hoon Chung: Electronics and Telecommunications Research Institute, Daejeon 34129, Korea
Sung Joo Lee: Electronics and Telecommunications Research Institute, Daejeon 34129, Korea
Hyeong Bae Jeon: Electronics and Telecommunications Research Institute, Daejeon 34129, Korea
Jeon Gue Park: Electronics and Telecommunications Research Institute, Daejeon 34129, Korea

DOI: https://doi.org/10.3390/app10103542
Journal volume & issue: Vol. 10, no. 10
p. 3542

Abstract

Read online

In this paper, we propose a policy gradient-based semi-supervised speech recognition acoustic model training. In practice, self-training and teacher/student learning are one of the widely used semi-supervised training methods due to their scalability and effectiveness. These methods are based on generating pseudo labels for unlabeled samples using a pre-trained model and selecting reliable samples using confidence measure. However, there are some considerations in this approach. The generated pseudo labels can be biased depending on which pre-trained model is used, and the training process can be complicated because the confidence measure is usually carried out in post-processing using external knowledge. Therefore, to address these issues, we propose a policy gradient method-based approach. Policy gradient is a reinforcement learning algorithm to find an optimal behavior strategy for an agent to obtain optimal rewards. The policy gradient-based approach provides a framework for exploring unlabeled data as well as exploiting labeled data, and it also provides a way to incorporate external knowledge in the same training cycle. The proposed approach was evaluated on an in-house non-native Korean recognition domain. The experimental results show that the method is effective in semi-supervised acoustic model training.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords