A Robust Voice Pathology Detection System Based on the Combined BiLSTM–CNN Architecture

Rimah Amami; Rim Amami; Chiraz Trabelsi; Sherin Hassan Mabrouk; Hassan A. Khalil

doi:10.13164/mendel.2023.2.202

Mendel (Dec 2023)

A Robust Voice Pathology Detection System Based on the Combined BiLSTM–CNN Architecture

Rimah Amami,
Rim Amami,
Chiraz Trabelsi,
Sherin Hassan Mabrouk,
Hassan A. Khalil

Affiliations

Rimah Amami: Computer Department, Deanship of Preparatory Year and Supporting Studies, Imam AbdulRahman bin Faisal University, Dammam, KSA
Rim Amami: Basic Science Department, Deanship of Preparatory Year and Supporting Studies, Imam AbdulRahman bin Faisal University, Dammam, KSA
Chiraz Trabelsi: Institut Montpellierain Alexander Grothendieck, UMR CNRS 5149, Place Eugene Bataillon, 34090, Montpellier, France
Sherin Hassan Mabrouk: Self-Development Department, Deanship of Preparatory Year and Supporting Studies, Imam AbdulRahman bin Faisal University, Dammam, KSA
Hassan A. Khalil: Department of Mathematics, Faculty of Science, Zagazig University, Zagazig 44519, Egypt

DOI: https://doi.org/10.13164/mendel.2023.2.202
Journal volume & issue: Vol. 29, no. 2

Abstract

Read online

Voice recognition systems have become increasingly important in recent years due to the growing need for more efficient and intuitive human-machine interfaces. The use of Hybrid LSTM networks and deep learning has been very successful in improving speech detection systems. The aim of this paper is to develop a novel approach for the detection of voice pathologies using a hybrid deep learning model that combines the Bidirectional Long Short-Term Memory (BiLSTM) and the Convolutional Neural Network (CNN) architectures. The proposed model uses a combination of temporal and spectral features extracted from speech signals to detect the different types of voice pathologies. The performance of the proposed detection model is evaluated on a publicly available dataset of speech signals from individuals with various voice pathologies(MEEI database). The experimental results showed that the hybrid BiLSTM-CNN model outperforms several classifiers by achieving an accuracy of 98.86\%. The proposed model has the potential to assist health care professionals in the accurate diagnosis and treatment of voice pathologies, and improving the quality of life for affected individuals.

Published in Mendel

ISSN: 1803-3814 (Print); 2571-3701 (Online)
Publisher: Brno University of Technology
Country of publisher: Czechia
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://mendel-journal.org

About the journal

Abstract

Keywords