Journal of King Saud University: Computer and Information Sciences (Sep 2022)
Dissecting the genre of Nigerian music with machine learning models
Abstract
Music Information Retrieval (MIR) is the task of extracting high-level information, such as genre, artist or instrumentation from music. Genre classification is an important and rapidly evolving research area of MIR. To date, only a small amount of research work has been done on the automatic genre classification of Nigerian songs. Hence, this study presents a new music dataset, namely the ORIN dataset, consisting of only Nigerian songs. The study dataset contains 478 Nigerian traditional songs from five genres: fuji, juju, highlife, waka and apala. The timbral texture and tempo features were mined from 30-second segments of each song using the Librosa Python library. For genre classification, the ORIN datasets was trained on 4 different classifiers- k-Nearest Neighbhour, Support Vector Machine, eXtreme Gradient Boosting (XGBoost) and Random Forest- with 85–15 train-test splits. The results obtained for the five different genres indicates that XGBoost classifier is a better model, having the highest accuracy of 81.94% and recall of 84.57%. This study uses the global mean (Tree SHAP) method to determine feature importance and impact on the classification model. Further analysis on the individual genres found some nearness in the timbral properties between some of the genres. This analysis was confirmed by human observation.