CNN AND LSTM FOR THE CLASSIFICATION  OF PARKINSON'S DISEASE BASED ON THE GTCC AND MFCC

Nouhaila BOUALOULOU; Taoufiq BELHOUSSINE DRISSI; Benayad NSIRI

doi:10.35784/acs-2023-11

Applied Computer Science (Jun 2023)

CNN AND LSTM FOR THE CLASSIFICATION OF PARKINSON'S DISEASE BASED ON THE GTCC AND MFCC

Nouhaila BOUALOULOU ,
Taoufiq BELHOUSSINE DRISSI,
Benayad NSIRI

Affiliations

Nouhaila BOUALOULOU: ORCiD; Laboratory Electrical and Industrial Engineering, Information Processing, Informatics, and Logistics (GEITIIL), Faculty of Science Ain Chock, University Hassan II, Casablanca, Morocco, [email protected]
Taoufiq BELHOUSSINE DRISSI: ORCiD; Laboratory Electrical and Industrial Engineering, Information Processing, Informatics, and Logistics (GEITIIL), Faculty of Science Ain Chock, University Hassan II, Casablanca, Morocco, [email protected]
Benayad NSIRI: ORCiD; Research Center STIS, M2CS, National Higher School of Arts and Craft, Rabat (ENSAM), Mohammed V University in Rabat, Morocco, [email protected]

DOI: https://doi.org/10.35784/acs-2023-11
Journal volume & issue: Vol. 19, no. 2
pp. 1 – 24

Abstract

Read online

Parkinson's disease is a recognizable clinical syndrome with a variety of causes and clinical presentations; it represents a rapidly growing neurodegenerative disorder. Since about 90 percent of Parkinson's disease sufferers have some form of early speech impairment, recent studies on tele diagnosis of Parkinson's disease have focused on the recognition of voice impairments from vowel phonations or the subjects' discourse. This paper presents a new approach for Parkinson's disease detection from speech sounds that are based on CNN and LSTM and uses two categories of characteristics. These are Mel Frequency Cepstral Coefficients (MFCC) and Gammatone Cepstral Coefficients (GTCC) obtained from noise-removed speech signals with comparative EMD-DWT and DWT-EMD analysis. The proposed model is divided into three stages. In the first step, noise is removed from the signals using the EMD-DWT and DWT-EMD methods. In the second step, the GTCC and MFCC are extracted from the enhanced audio signals. The classification process is carried out in the third step by feeding these features into the LSTM and CNN models, which are designed to define sequential information from the extracted features. The experiments are performed using PC-GITA and Sakar datasets and 10-fold cross validation method, the highest classification accuracy for the Sakar dataset reached 100% for both EMD-DWT-GTCC-CNN and DWT-EMD-GTCC-CNN, and for the PC-GITA dataset, the accuracy is reached 100% for EMD-DWT-GTCC-CNN and 96.55% for DWT-EMD-GTCC-CNN. The results of this study indicate that the characteristics of GTCC are more appropriate and accurate for the assessment of PD than MFCC.

Published in Applied Computer Science

ISSN: 1895-3735 (Print); 2353-6977 (Online)
Publisher: Polish Association for Knowledge Promotion
Country of publisher: Poland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.acs.pollub.pl/

About the journal

Abstract

Keywords