Analysis of Smartphone Recordings in Time, Frequency, and Cepstral Domains to Classify Parkinson’s Disease

Ilias Tougui; Abdelilah Jilbab; Jamal El Mhamdi

doi:10.4258/hir.2020.26.4.274

Healthcare Informatics Research (Oct 2020)

Analysis of Smartphone Recordings in Time, Frequency, and Cepstral Domains to Classify Parkinson’s Disease

Ilias Tougui,
Abdelilah Jilbab,
Jamal El Mhamdi

Affiliations

Ilias Tougui: Department of Biomedical Engineering, Mohammed V University in Rabat, Morocco
Abdelilah Jilbab: Department of Biomedical Engineering, Mohammed V University in Rabat, Morocco
Jamal El Mhamdi: Department of Biomedical Engineering, Mohammed V University in Rabat, Morocco

DOI: https://doi.org/10.4258/hir.2020.26.4.274
Journal volume & issue: Vol. 26, no. 4
pp. 274 – 283

Abstract

Read online

Objectives Parkinson’s disease (PD) is the second most common neurodegenerative disorder; it affects more than 10 million people worldwide. Detecting PD usually requires a professional assessment by an expert, and investigation of the voice as a biomarker of the disease could be effective in speeding up the diagnostic process. Methods We present our methodology in which we distinguish PD patients from healthy controls (HC) using a large sample of 18,210 smartphone recordings. Those recordings were processed by an audio processing technique to create a final dataset of 80,594 instances and 138 features from the time, frequency, and cepstral domains. This dataset was preprocessed and normalized to create baseline machine-learning models using four classifiers, namely, linear support vector machine, K-nearest neighbor, random forest, and extreme gradient boosting (XGBoost). We divided our dataset into training and held-out test sets. Then we used stratified 5-fold cross-validation and four performance measures: accuracy, sensitivity, specificity, and F1-score to assess the performance of the models. We applied two feature selection methods, analysis of variance (ANOVA) and least absolute shrinkage and selection operator (LASSO), to reduce the dimensionality of the dataset by selecting the best subset of features that maximizes the performance of the classifiers. Results LASSO outperformed ANOVA with almost the same number of features. With 33 features, XGBoost achieved a maximum accuracy of 95.31% on training data, and 95.78% by predicting unseen data. Conclusions Developing a smartphone-based system that implements machine-learning techniques is an effective way to diagnose PD using the voice as a biomarker.

Published in Healthcare Informatics Research

ISSN: 2093-3681 (Print); 2093-369X (Online)
Publisher: The Korean Society of Medical Informatics
Country of publisher: Korea, Republic of
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: http://www.e-hir.org

About the journal

Abstract

Keywords