EURASIP Journal on Advances in Signal Processing (Jan 2010)

Histogram Equalization to Model Adaptation for Robust Speech Recognition

  • Suh Youngjoo,
  • Kim Hoirin

Journal volume & issue
Vol. 2010, no. 1
p. 628018

Abstract

Read online

We propose a new model adaptation method based on the histogram equalization technique for providing robustness in noisy environments. The trained acoustic mean models of a speech recognizer are adapted into environmentally matched conditions by using the histogram equalization algorithm on a single utterance basis. For more robust speech recognition in the heavily noisy conditions, trained acoustic covariance models are efficiently adapted by the signal-to-noise ratio-dependent linear interpolation between trained covariance models and utterance-level sample covariance models. Speech recognition experiments on both the digit-based Aurora2 task and the large vocabulary-based task showed that the proposed model adaptation approach provides significant performance improvements compared to the baseline speech recognizer trained on the clean speech data.