A Preliminary Study of Robust Speech Feature Extraction Based on Maximizing the Probability of States in Deep Acoustic Models

Li-Chia Chang; Jeih-Weih Hung

doi:10.3390/asi5040071

Applied System Innovation (Jul 2022)

A Preliminary Study of Robust Speech Feature Extraction Based on Maximizing the Probability of States in Deep Acoustic Models

Li-Chia Chang,
Jeih-Weih Hung

Affiliations

Li-Chia Chang: Department of Electrical Engineering, National Chi Nan University, Nantou 545, Taiwan
Jeih-Weih Hung: Department of Electrical Engineering, National Chi Nan University, Nantou 545, Taiwan

DOI: https://doi.org/10.3390/asi5040071
Journal volume & issue: Vol. 5, no. 4
p. 71

Abstract

Read online

This study proposes a novel robust speech feature extraction technique to improve speech recognition performance in noisy environments. This novel method exploits the information provided by the original acoustic model in the automatic speech recognition (ASR) system to learn a deep neural network that converts the original speech features. This deep neural network is trained to maximize the posterior accuracy of the state sequences of acoustic models with respect to the speech feature sequences. Compared with the robustness methods that retrain or adapt acoustic models, the new method has the advantages of less computation load and faster training. In the experiments conducted on the medium-vocabulary TIMIT database and task, the presented method provides lower word error rates than the unprocessed baseline and speech-enhancement-based techniques. These results indicate that the presented method is promising and worth further developing.

Published in Applied System Innovation

ISSN: 2571-5577 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Applied mathematics. Quantitative methods
Website: https://www.mdpi.com/journal/asi

About the journal

Abstract

Keywords