Hybrid Feature Selection Method Based on Harmony Search and Naked Mole-Rat Algorithms for Spoken Language Identification From Audio Signals

Samarpan Guha; Aankit Das; Pawan Kumar Singh; Ali Ahmadian; Norazak Senu; Ram Sarkar

doi:10.1109/ACCESS.2020.3028121

IEEE Access (Jan 2020)

Hybrid Feature Selection Method Based on Harmony Search and Naked Mole-Rat Algorithms for Spoken Language Identification From Audio Signals

Samarpan Guha,
Aankit Das,
Pawan Kumar Singh,
Ali Ahmadian,
Norazak Senu,
Ram Sarkar

Affiliations

Samarpan Guha: ORCiD; Institute of Radio Physics and Electronics, University of Calcutta, Kolkata, India
Aankit Das: ORCiD; Institute of Radio Physics and Electronics, University of Calcutta, Kolkata, India
Pawan Kumar Singh: ORCiD; Department of Information Technology, Jadavpur University, Kolkata, India
Ali Ahmadian: ORCiD; Institute of IR 4.0, The National University of Malaysia, Bangi, Malaysia
Norazak Senu: ORCiD; Institute for Mathematical Research, Universiti Putra Malaysia, Seri Kembangan, Malaysia
Ram Sarkar: ORCiD; Department of Computer Science and Engineering, Jadavpur University, Kolkata, India

DOI: https://doi.org/10.1109/ACCESS.2020.3028121
Journal volume & issue: Vol. 8
pp. 182868 – 182887

Abstract

Read online

This era is dominated by artificial intelligence and its various applications - one of which is Spoken Language Identification (S-LID) which has always been a challenging issue and an important research area in the domain of speech signal processing. This paper deals with S-LID to be used for Human-Computer Interaction (HCI) based applications by attempting to classify various languages from three multi-lingual databases namely CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages, VoxForge and Indian Institute of Technology, Madras (IIT-Madras) speech corpus database by extracting their Mel-Spectrogram features and Relative Spectral Transform - Perceptual Linear Prediction (RASTA-PLP) features. A new hybrid Feature Selection (FS) algorithm have been developed using the versatile Harmony Search (HS) algorithm and a new nature-inspired algorithm called Naked Mole-Rat (NMR) algorithm to select the best subset of features and reduce the model complexity to help it train faster. This selected feature set is fed to five classifiers namely Support Vector Machine (SVM), k-Nearest Neighbor (k-NN), Multi-layer Perceptron (MLP), Naïve Bayes (NB) and Random Forest (RF). The evaluation measures used in this paper are precision, recall, f1-score, classification accuracy and number of selected features. An accuracy of 99.89% on CSS10, 98.22% on VoxForge and 99.75% on IIT-Madras speech corpus databases is achieved using RF. Furthermore, the proposed algorithm is found to outperform 15 standard meta-heuristic FS algorithms. The source code of this work is available at: https://github.com/CodeChef97dotcom/HS-NMR.git.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords