Optimizing Integrated Features for Hindi Automatic Speech Recognition System

Dua Mohit; Aggarwal Rajesh Kumar; Biswas Mantosh

doi:10.1515/jisys-2018-0057

Journal of Intelligent Systems (Oct 2018)

Optimizing Integrated Features for Hindi Automatic Speech Recognition System

Dua Mohit,
Aggarwal Rajesh Kumar,
Biswas Mantosh

Affiliations

Dua Mohit: Department of Computer Engineering, National Institute of Technology, Kurukshetra 136119, India
Aggarwal Rajesh Kumar: Department of Computer Engineering, National Institute of Technology, Kurukshetra 136119, India
Biswas Mantosh: Department of Computer Engineering, National Institute of Technology, Kurukshetra 136119, India

DOI: https://doi.org/10.1515/jisys-2018-0057
Journal volume & issue: Vol. 29, no. 1
pp. 959 – 976

Abstract

Read online

An automatic speech recognition (ASR) system translates spoken words or utterances (isolated, connected, continuous, and spontaneous) into text format. State-of-the-art ASR systems mainly use Mel frequency (MF) cepstral coefficient (MFCC), perceptual linear prediction (PLP), and Gammatone frequency (GF) cepstral coefficient (GFCC) for extracting features in the training phase of the ASR system. Initially, the paper proposes a sequential combination of all three feature extraction methods, taking two at a time. Six combinations, MF-PLP, PLP-MFCC, MF-GFCC, GF-MFCC, GF-PLP, and PLP-GFCC, are used, and the accuracy of the proposed system using all these combinations was tested. The results show that the GF-MFCC and MF-GFCC integrations outperform all other proposed integrations. Further, these two feature vector integrations are optimized using three different optimization methods, particle swarm optimization (PSO), PSO with crossover, and PSO with quadratic crossover (Q-PSO). The results demonstrate that the Q-PSO-optimized GF-MFCC integration show significant improvement over all other optimized combinations.

Published in Journal of Intelligent Systems

ISSN: 0334-1860 (Print); 2191-026X (Online)
Publisher: De Gruyter
Country of publisher: Poland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.degruyter.com/view/journals/jisys/jisys-overview.xml

About the journal

Abstract

Keywords