A Tutorial on Text-Independent Speaker Verification

Frédéric  Bimbot; Jean-François  Bonastre; Corinne  Fredouille; Guillaume  Gravier; Ivan  Magrin-Chagnolleau; Sylvain  Meignier; Teva  Merlin; Javier  Ortega-García; Dijana  Petrovska-Delacrétaz; Douglas A. Reynolds

doi:10.1155/S1687617204310024

EURASIP Journal on Advances in Signal Processing (Apr 2004)

A Tutorial on Text-Independent Speaker Verification

Frédéric Bimbot,
Jean-François Bonastre,
Corinne Fredouille,
Guillaume Gravier,
Ivan Magrin-Chagnolleau,
Sylvain Meignier,
Teva Merlin,
Javier Ortega-García,
Dijana Petrovska-Delacrétaz,
Douglas A. Reynolds

Affiliations

Frédéric Bimbot
Jean-François Bonastre
Corinne Fredouille
Guillaume Gravier
Ivan Magrin-Chagnolleau
Sylvain Meignier
Teva Merlin
Javier Ortega-García
Dijana Petrovska-Delacrétaz
Douglas A. Reynolds

DOI: https://doi.org/10.1155/S1687617204310024
Journal volume & issue: Vol. 2004, no. 4
pp. 430 – 451

Abstract

Read online

This paper presents an overview of a state-of-the-art text-independent speaker verification system. First, an introduction proposes a modular scheme of the training and test phases of a speaker verification system. Then, the most commonly speech parameterization used in speaker verification, namely, cepstral analysis, is detailed. Gaussian mixture modeling, which is the speaker modeling technique used in most systems, is then explained. A few speaker modeling alternatives, namely, neural networks and support vector machines, are mentioned. Normalization of scores is then explained, as this is a very important step to deal with real-world data. The evaluation of a speaker verification system is then detailed, and the detection error trade-off (DET) curve is explained. Several extensions of speaker verification are then enumerated, including speaker tracking and segmentation by speakers. Then, some applications of speaker verification are proposed, including on-site applications, remote applications, applications relative to structuring audio information, and games. Issues concerning the forensic area are then recalled, as we believe it is very important to inform people about the actual performance and limitations of speaker verification systems. This paper concludes by giving a few research trends in speaker verification for the next couple of years.

Published in EURASIP Journal on Advances in Signal Processing

ISSN: 1687-6172 (Print); 1687-6180 (Online)
Publisher: SpringerOpen
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Telecommunication; Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics
Website: https://asp-eurasipjournals.springeropen.com

About the journal

Abstract

Keywords