Journal of Engineering and Applied Science (Jan 2024)

An investigation into the reliability of speaker recognition schemes: analysing the impact of environmental factors utilising deep learning techniques

  • Omar Ratib Khazaleh,
  • Leen Ahmed Khrais

DOI
https://doi.org/10.1186/s44147-023-00351-0
Journal volume & issue
Vol. 71, no. 1
pp. 1 – 24

Abstract

Read online

Abstract This paper studies the performance and reliability of deep learning-based speaker recognition schemes under various recording situations and background noise presence. The study uses the Speaker Recognition Dataset offered in the Kaggle website, involving audio recordings from different speakers, and four scenarios with various combinations of speakers. In the first scenario, the scheme achieves discriminating capability and high accuracy in identifying speakers without taking into account outside noise, having roughly one area under the ROC curve. Nevertheless, in the second scenario, with background noise added to the recording, accuracy decreases, and misclassifications increase. However, the scheme still reveals good discriminating power, with ROC areas ranging from 0.77 to 1.

Keywords