Applied Sciences (Nov 2022)

A Binaural MFCC-CNN Sound Quality Model of High-Speed Train

  • Peilin Ruan,
  • Xu Zheng,
  • Yi Qiu,
  • Zhiyong Hao

DOI
https://doi.org/10.3390/app122312151
Journal volume & issue
Vol. 12, no. 23
p. 12151

Abstract

Read online

The high-speed train (HST) is one of the most important transport tools in China, and the sound quality of its interior noise affects passengers’ comfort. This paper proposes a HST sound quality model. The model combines Mel-scale frequency cepstral coefficients (MFCCs), the most popular spectral-based input parameter in deep learning models, with convolutional neural networks (CNNs) to evaluate the sound quality of HSTs. Meanwhile, two input channels are applied to simulate binaural hearing so that the different sound signals can be processed separately. The binaural MFCC-CNN model achieves an accuracy of 96.2% and outperforms the traditional shallow neural network model because it considers the time-varying characteristics of noise. The MFCC features are capable of capturing the characteristics of noise and improving the accuracy of sound quality evaluations. Besides, the results suggest that the time and level differences in sound signals are important factors affecting sound quality at low annoyance levels. The proposed model is expected to optimize the comfort of the interior acoustic environment of HSTs.

Keywords