A Small Brazilian Portuguese Speech Corpus for Speaker Recognition Study

Alberto Yoshihiro Nakano; Hélio Rodrigues da Silva; Julian Rodrigues Dourado; Felipe Walter Dafico Pfrimer

doi:10.5433/1679-0375.2024.v45.50518

Semina: Ciências Exatas e Tecnológicas (Jun 2024)

A Small Brazilian Portuguese Speech Corpus for Speaker Recognition Study

Alberto Yoshihiro Nakano,
Hélio Rodrigues da Silva,
Julian Rodrigues Dourado,
Felipe Walter Dafico Pfrimer

Affiliations

Alberto Yoshihiro Nakano: ORCiD; Universidade Tecnológica Federal do Paraná
Hélio Rodrigues da Silva: ORCiD; Universidade Tecnológica Federal do Paraná
Julian Rodrigues Dourado: ORCiD; Universidade Tecnológica Federal do Paraná
Felipe Walter Dafico Pfrimer: ORCiD; Universidade Tecnológica Federal do Paraná

DOI: https://doi.org/10.5433/1679-0375.2024.v45.50518
Journal volume & issue: Vol. 45

Abstract

Read online

A small Brazilian speech corpus was created for educational purposes to study a state-of-the-art speaker recognition system. The system uses the Gaussian Mixture Model (GMM) as a statistical model for speakers and employs the Mel-frequency cepstral coefficients (MFCC) as acoustic features. The results using clean and noisy speech are compatible with the expected results, showing that the bigger the mismatch between training and test conditions, the worse the results. The results also improve with the increase in the utterance length. Finally, the obtained results can be used as baselines to compare with other speaker statistical models created with different acoustic features in different acoustic conditions.

Published in Semina: Ciências Exatas e Tecnológicas

ISSN: 1676-5451 (Print); 1679-0375 (Online)
Publisher: Universidade Estadual de Londrina
Country of publisher: Brazil
LCC subjects: Technology: Technology (General); Science: Science (General)
Website: http://www.uel.br/revistas/uel/index.php/semexatas/index

About the journal

Abstract

Keywords