Applied Sciences (Mar 2020)

EfficientMelody Extraction Based on an Extreme Learning Machine

  • Weiwei Zhang,
  • Qiaoling Zhang,
  • Sheng Bi,
  • Shaojun Fang,
  • Jinliang Dai

DOI
https://doi.org/10.3390/app10072213
Journal volume & issue
Vol. 10, no. 7
p. 2213

Abstract

Read online

Melody extraction is an important task in music information retrieval community and it is unresolved due to the complex nature of real-world recordings. In this paper, the melody extraction problem is addressed in the extreme learning machine (ELM) framework. More specifically, the input musical signal is first pre-processed to mimic the human auditory system. The music features are then constructed by constant-Q transform (CQT), and the concentration strategy is introduced to make use of contextual information. Afterwards, the rough melody pitches are determined by ELM network, according to its pre-trained parameters. Finally, the rough melody pitches are fine-tuned by the spectral peaks around the frame-wise rough pitches. The proposed method can extract melody from polyphonic music efficiently and effectively, where pitch estimation and voicing detection are conducted jointly. Some experiments have been conducted based on three publicly available datasets. The experimental results reveal that the proposed method achieves higher overall accuracies with very fast speed.

Keywords