Self-Supervised Open-Set Speaker Recognition with Laguerre–Voronoi Descriptors

Abu Quwsar Ohi; Marina L. Gavrilova

doi:10.3390/s24061996

Sensors (Mar 2024)

Self-Supervised Open-Set Speaker Recognition with Laguerre–Voronoi Descriptors

Abu Quwsar Ohi,
Marina L. Gavrilova

Affiliations

Abu Quwsar Ohi: Department of Computer Science, University of Calgary, Calgary, AB T2N1N4, Canada
Marina L. Gavrilova: Department of Computer Science, University of Calgary, Calgary, AB T2N1N4, Canada

DOI: https://doi.org/10.3390/s24061996
Journal volume & issue: Vol. 24, no. 6
p. 1996

Abstract

Read online

Speaker recognition is a challenging problem in behavioral biometrics that has been rigorously investigated over the last decade. Although numerous supervised closed-set systems inherit the power of deep neural networks, limited studies have been made on open-set speaker recognition. This paper proposes a self-supervised open-set speaker recognition that leverages the geometric properties of speaker distribution for accurate and robust speaker verification. The proposed framework consists of a deep neural network incorporating a wider viewpoint of temporal speech features and Laguerre–Voronoi diagram-based speech feature extraction. The deep neural network is trained with a specialized clustering criterion that only requires positive pairs during training. The experiments validated that the proposed system outperformed current state-of-the-art methods in open-set speaker recognition and cluster representation.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords