工程科学学报 (Sep 2015)

Continuous speech recognition by convolutional neural networks

  • ZHANG Qing-qing,
  • LIU Yong,
  • PAN Jie-lin,
  • YAN Yong-hong

DOI
https://doi.org/10.13374/j.issn2095-9389.2015.09.015
Journal volume & issue
Vol. 37, no. 9
pp. 1212 – 1217

Abstract

Read online

Convolutional neural networks (CNNs), which show success in achieving translation invariance for many image processing tasks, were investigated for continuous speech recognition. Compared to deep neural networks (DNNs), which are proven to be successful in many speech recognition tasks nowadays, CNNs can reduce the neural network model sizes significantly, and at the same time achieve even a better recognition accuracy. Experiments on standard speech corpus TIMIT and conversational speech corpus show that CNNs outperform DNNs in terms of the accuracy and the generalization ability.

Keywords