International Journal of Mathematical, Engineering and Management Sciences (Feb 2024)

Statistically Significant Duration-Independent-based Noise-Robust Speaker Verification

  • Asmita Nirmal,
  • Deepak Jayaswal,
  • Pramod H. Kachare

DOI
https://doi.org/10.33889/IJMEMS.2024.9.1.008
Journal volume & issue
Vol. 9, no. 1
pp. 147 – 162

Abstract

Read online

A speaker verification system models individual speakers using different speech features to improve their robustness. However, redundant features degrade the system's performance. This paper presents Statistically Significant Duration-Independent Mel frequency Cepstral Coefficients (SSDI-MFCC) features with the Extreme Gradient Boost classifier for improving the noise-robustness of speaker models. Eight statistical descriptors are used to generate signal duration-independent features, and a statistically significant feature subset is obtained using a t-test. A redeveloped Librispeech database by adding noises from the AURORA database to simulate real-world test conditions for speaker verification is used for evaluation. The SSDI-MFCC is compared with Principal Component Analysis (PCA) and Genetic Algorithm (GA). The comparative results showed average equal error rate improvements by 4.93 % and 3.48 % with the SSDI-MFCC than GA-MFCC and PCA-MFCC in clean and noisy conditions, respectively. A significant reduction in verification time is observed using SSDI-MFCC than the complete feature set.

Keywords