ASVtorch toolkit: Speaker verification with deep neural networks

Kong Aik Lee; Ville Vestman; Tomi Kinnunen

SoftwareX (Jun 2021)

ASVtorch toolkit: Speaker verification with deep neural networks

Kong Aik Lee,
Ville Vestman,
Tomi Kinnunen

Affiliations

Kong Aik Lee: Institute for Infocomm Research, A*STAR, Singapore
Ville Vestman: Computational Speech Group, University of Eastern Finland, Finland
Tomi Kinnunen: Computational Speech Group, University of Eastern Finland, Finland; Corresponding author.

Journal volume & issue: Vol. 14
p. 100697

Abstract

Read online

The human voice differs substantially between individuals. This facilitates automatic speaker verification (ASV) — recognizing a person from his/her voice. ASV accuracy has substantially increased throughout the past decade due to recent advances in machine learning, particularly deep learning methods. An unfortunate downside has been substantially increased complexity of ASV systems. To help non-experts to kick-start reproducible ASV development, a state-of-the-art toolkit implementing various ASV pipelines and functionalities is required. To this end, we introduce a new open-source toolkit, ASVtorch, implemented in Python using the widely used PyTorch machine learning framework.

Published in SoftwareX

ISSN: 2352-7110 (Online)
Publisher: Elsevier
Country of publisher: Netherlands
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: http://www.journals.elsevier.com/softwarex/

About the journal

Abstract

Keywords