Discriminatively trained continuous Hindi speech recognition using integrated acoustic features and recurrent neural network language modeling

Kumar A.; Aggarwal R.K.

doi:10.1515/jisys-2018-0417

Journal of Intelligent Systems (Jul 2020)

Discriminatively trained continuous Hindi speech recognition using integrated acoustic features and recurrent neural network language modeling

Kumar A.,
Aggarwal R.K.

Affiliations

Kumar A.: Computer Engineering Department, National Institute of Technology, Kurukshetra, Haryana, India
Aggarwal R.K.: Computer Engineering Department, National Institute of Technology, Kurukshetra, Haryana, India

DOI: https://doi.org/10.1515/jisys-2018-0417
Journal volume & issue: Vol. 30, no. 1
pp. 165 – 179

Abstract

Read online

This paper implements the continuous Hindi Automatic Speech Recognition (ASR) system using the proposed integrated features vector with Recurrent Neural Network (RNN) based Language Modeling (LM). The proposed system also implements the speaker adaptation using Maximum-Likelihood Linear Regression (MLLR) and Constrained Maximum likelihood Linear Regression (C-MLLR). This system is discriminatively trained by Maximum Mutual Information (MMI) and Minimum Phone Error (MPE) techniques with 256 Gaussian mixture per Hidden Markov Model(HMM) state. The training of the baseline system has been done using a phonetically rich Hindi dataset. The results show that discriminative training enhances the baseline system performance by up to 3%. Further improvement of ~7% has been recorded by applying RNN LM. The proposed Hindi ASR system shows significant performance improvement over other current state-of-the-art techniques.

Published in Journal of Intelligent Systems

ISSN: 0334-1860 (Print); 2191-026X (Online)
Publisher: De Gruyter
Country of publisher: Poland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.degruyter.com/view/journals/jisys/jisys-overview.xml

About the journal

Abstract

Keywords