Doklady Belorusskogo gosudarstvennogo universiteta informatiki i radioèlektroniki (Jun 2023)

IT Diagnostics of Parkinson’s Disease Based on the Analysis of Voice Markers and Machine Learning

  • U. A. Vishniakou,
  • Xia YiWei

DOI
https://doi.org/10.35596/1729-7648-2023-21-3-102-110
Journal volume & issue
Vol. 21, no. 3
pp. 102 – 110

Abstract

Read online

The results of studying the parameters of the spectra of speech signals by machine learning with the use of neural networks are presented. This study was carried out in order to confirm experimentally the possibility of performing an assessment of these parameters for the detection of Parkinson’s disease in the early stages (IT diagnostics). During the study, the public database was used, which systematized the spectra of vowel sounds uttered by patients with Parkinson’s disease. The applied method is binary data classification. In the course of the study, the speech data spectrum was first preprocessed, which consisted of filtering it in order to remove its noise components and eliminate bursts and gaps in it. Then the parameters of the processed spectrum of speech data were determined: average value, maximum and minimum, peak, wavelet coefficients, MFCC and TQWT. After that, the object was selected using the PCA algorithm. The model was trained using the Knn and Random Forest algorithms, as well as the Bayesian neural network. The Bayesian optimization algorithm and the GridSearch method were used to find the best model hyperparameters. It has been established that when using Knn, Random Forest and Bayesian neural network, it is possible to increase the accuracy of recognition of Parkinson’s disease by 94.7; 88.16 and 74.74 %, respectively. A similar study by other scientists showed that the recognition accuracy of data sets was only 86 %.

Keywords