SoftwareX (Jun 2021)

ASVtorch toolkit: Speaker verification with deep neural networks

  • Kong Aik Lee,
  • Ville Vestman,
  • Tomi Kinnunen

Journal volume & issue
Vol. 14
p. 100697

Abstract

Read online

The human voice differs substantially between individuals. This facilitates automatic speaker verification (ASV) — recognizing a person from his/her voice. ASV accuracy has substantially increased throughout the past decade due to recent advances in machine learning, particularly deep learning methods. An unfortunate downside has been substantially increased complexity of ASV systems. To help non-experts to kick-start reproducible ASV development, a state-of-the-art toolkit implementing various ASV pipelines and functionalities is required. To this end, we introduce a new open-source toolkit, ASVtorch, implemented in Python using the widely used PyTorch machine learning framework.

Keywords