Pay attention to the speech: COVID-19 diagnosis using machine learning and crowdsourced respiratory and speech recordings

Mahmoud Aly; Kamel H. Rahouma; Safwat M. Ramzy

Alexandria Engineering Journal (May 2022)

Pay attention to the speech: COVID-19 diagnosis using machine learning and crowdsourced respiratory and speech recordings

Mahmoud Aly,
Kamel H. Rahouma,
Safwat M. Ramzy

Affiliations

Mahmoud Aly: Department of Electrical Engineering, Faculty of Engineering, Minia University, Minia, Egypt; Corresponding author at: Department of Electrical Engineering, Faculty of Engineering, Minia University, 61519 Minia, Egypt.
Kamel H. Rahouma: Department of Electrical Engineering, Faculty of Engineering, Minia University, Minia, Egypt
Safwat M. Ramzy: Department of Electrical Engineering, Faculty of Engineering, Sohag University, Sohag, Egypt

Journal volume & issue: Vol. 61, no. 5
pp. 3487 – 3500

Abstract

Read online

Since the outbreak of COVID-19, many efforts have been made to utilize the respiratory sounds and coughs collected by smartphones for training Machine Learning models to classify and distinguish COVID-19 sounds from healthy ones. Embedding those models into mobile applications or Internet of things devices can make effective COVID-19 pre-screening tools afforded by anyone anywhere. Most of the previous researchers trained their classifiers with respiratory sounds such as breathing or coughs, and they achieved promising results. We claim that using special voice patterns besides other respiratory sounds can achieve better performance. In this study, we used the Coswara dataset where each user has recorded 9 different types of sounds as cough, breathing, and speech labeled with COVID-19 status. A combination of models trained on different sounds can diagnose COVID-19 more accurately than a single model trained on cough or breathing only. Our results show that using simple binary classifiers can achieve an AUC of 96.4% and an accuracy of 96% by averaging the predictions of multiple models trained and evaluated separately on different sound types. Finally, this study aims to draw attention to the importance of the human voice alongside other respiratory sounds for the sound-based COVID-19 diagnosis.

Published in Alexandria Engineering Journal

ISSN: 1110-0168 (Print); 2090-2670 (Online)
Publisher: Elsevier
Country of publisher: Egypt
LCC subjects: Technology: Engineering (General). Civil engineering (General)
Website: http://www.journals.elsevier.com/alexandria-engineering-journal/

About the journal

Abstract

Keywords