IEEE Access (Jan 2020)

A Novel Framework of Two Successive Feature Selection Levels Using Weight-Based Procedure for Voice-Loss Detection in Parkinson’s Disease

  • Amira S. Ashour,
  • Majid Kamal A. Nour,
  • Kemal Polat,
  • Yanhui Guo,
  • Wafaa Alsaggaf,
  • Amira El-Attar

DOI
https://doi.org/10.1109/ACCESS.2020.2989032
Journal volume & issue
Vol. 8
pp. 76193 – 76203

Abstract

Read online

Parkinson's disease (PD) is one of the public neuro-degenerative disorders. Speech/voice disorder is considered one of the symptoms at an early stage. Acoustic and speech signal processing methods can potentially evaluate and measure PD-related vocal impairment. The present work proposed a novel feature selection framework using two levels of the feature selection procedure for voice-loss detection in PD patients. At the first level selection, the principal component analysis (PCA) and the eigenvector centrality feature selection (ECFS) methods are initially calculated independently, and the selected features from each method are considered as a separated sublist, namely ECFS selected features sublist, and PCA selected features sublist, in the first set. Accordingly, the first set, which is the first level selection set, is generated from the union of these two sublists using the top-selected features from both methods. In the training phase, a second level selection, which forms the second set (which is a subset from the first set), is generated to calculate the proposed weight of each selection method. Since in the present work, the ECFS provided superior performance to the PCA in the first level selection, the ECFS is applied to the first set in order to find weight values based on the contribution/impact of the top-selected PCA- and ECFS- features in the second level. This weight is determined by finding a proposed ratio, which is multiplied directly by the selected ECFS features in the first level. The selected weighted ECFS features are then combined with the same PCA features to avoid ignoring any of the top-ranked features from the first level. This combination includes the final weighted-hybrid selected features that fed to a support vector machine (SVM) classifier to evaluate the proposed weighted hybrid selected features. Hence, in the test phase, the generated weight is used directly without any further need for the second level selection. Several comparative studies were conducted to evaluate the proposed feature selection performance for PD voice-loss detection. The experimental results established the superiority of the proposed procedure using cubic kernel-SVM with 94% accuracy for voice-loss detection in PD, while, with the same classifier, 88% accuracy was achieved without using the proposed selection method.

Keywords