Singing Voice Detection: A Survey

Ramy Monir; Daniel Kostrzewa; Dariusz Mrozek

doi:10.3390/e24010114

Entropy (Jan 2022)

Singing Voice Detection: A Survey

Ramy Monir,
Daniel Kostrzewa,
Dariusz Mrozek

Affiliations

Ramy Monir: Department of Applied Informatics, Silesian University of Technology, 44-100 Gliwice, Poland
Daniel Kostrzewa: Department of Applied Informatics, Silesian University of Technology, 44-100 Gliwice, Poland
Dariusz Mrozek: Department of Applied Informatics, Silesian University of Technology, 44-100 Gliwice, Poland

DOI: https://doi.org/10.3390/e24010114
Journal volume & issue: Vol. 24, no. 1
p. 114

Abstract

Read online

Singing voice detection or vocal detection is a classification task that determines whether there is a singing voice in a given audio segment. This process is a crucial preprocessing step that can be used to improve the performance of other tasks such as automatic lyrics alignment, singing melody transcription, singing voice separation, vocal melody extraction, and many more. This paper presents a survey on the techniques of singing voice detection with a deep focus on state-of-the-art algorithms such as convolutional LSTM and GRU-RNN. It illustrates a comparison between existing methods for singing voice detection, mainly based on the Jamendo and RWC datasets. Long-term recurrent convolutional networks have reached impressive results on public datasets. The main goal of the present paper is to investigate both classical and state-of-the-art approaches to singing voice detection.

Published in Entropy

ISSN: 1099-4300 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Astronomy: Astrophysics; Science: Physics
Website: http://www.mdpi.com/journal/entropy

About the journal

Abstract

Keywords