PeerJ Computer Science (Dec 2023)

Voice spoofing detection using a neural networks assembly considering spectrograms and mel frequency cepstral coefficients

  • Carlos Alberto Hernández-Nava,
  • Eric Alfredo Rincón-García,
  • Pedro Lara-Velázquez,
  • Sergio Gerardo de-los-Cobos-Silva,
  • Miguel Angel Gutiérrez-Andrade,
  • Roman Anselmo Mora-Gutiérrez

DOI
https://doi.org/10.7717/peerj-cs.1740
Journal volume & issue
Vol. 9
p. e1740

Abstract

Read online Read online

Nowadays, biometric authentication has gained relevance due to the technological advances that have allowed its inclusion in many daily-use devices. However, this same advantage has also brought dangers, as spoofing attacks are now more common. This work addresses the vulnerabilities of automatic speaker verification authentication systems, which are prone to attacks arising from new techniques for the generation of spoofed audio. In this article, we present a countermeasure for these attacks using an approach that includes easy to implement feature extractors such as spectrograms and mel frequency cepstral coefficients, as well as a modular architecture based on deep neural networks. Finally, we evaluate our proposal using the well-know ASVspoof 2017 V2 database, the experiments show that using the final architecture the best performance is obtained, achieving an equal error rate of 6.66% on the evaluation set.

Keywords