Identifying individuals with recent COVID-19 through voice classification using deep learning

Pichatorn Suppakitjanusant; Somnuek Sungkanuparph; Thananya Wongsinin; Sirapong Virapongsiri; Nittaya Kasemkosin; Laor Chailurkit; Boonsong Ongphiphadhanakul

doi:10.1038/s41598-021-98742-x

Scientific Reports (Sep 2021)

Identifying individuals with recent COVID-19 through voice classification using deep learning

Pichatorn Suppakitjanusant,
Somnuek Sungkanuparph,
Thananya Wongsinin,
Sirapong Virapongsiri,
Nittaya Kasemkosin,
Laor Chailurkit,
Boonsong Ongphiphadhanakul

Affiliations

Pichatorn Suppakitjanusant: Chakri Naruebodindra Medical Institute, Faculty of Medicine Ramathibodi Hospital, Mahidol University
Somnuek Sungkanuparph: Chakri Naruebodindra Medical Institute, Faculty of Medicine Ramathibodi Hospital, Mahidol University
Thananya Wongsinin: Chakri Naruebodindra Medical Institute, Faculty of Medicine Ramathibodi Hospital, Mahidol University
Sirapong Virapongsiri: Chakri Naruebodindra Medical Institute, Faculty of Medicine Ramathibodi Hospital, Mahidol University
Nittaya Kasemkosin: Department of Communication Sciences and Disorders, Faculty of Medicine Ramathibodi Hospital, Mahidol University
Laor Chailurkit: Division of Endocrinology and Metabolism, Department of Medicine, Faculty of Medicine Ramathibodi Hospital, Mahidol University
Boonsong Ongphiphadhanakul: Division of Endocrinology and Metabolism, Department of Medicine, Faculty of Medicine Ramathibodi Hospital, Mahidol University

DOI: https://doi.org/10.1038/s41598-021-98742-x
Journal volume & issue: Vol. 11, no. 1
pp. 1 – 7

Abstract

Read online

Abstract Recently deep learning has attained a breakthrough in model accuracy for the classification of images due mainly to convolutional neural networks. In the present study, we attempted to investigate the presence of subclinical voice feature alteration in COVID-19 patients after the recent resolution of disease using deep learning. The study was a prospective study of 76 post COVID-19 patients and 40 healthy individuals. The diagnoses of post COVID-19 patients were based on more than the eighth week after onset of symptoms. Voice samples of an ‘ah’ sound, coughing sound and a polysyllabic sentence were collected and preprocessed to log-mel spectrogram. Transfer learning using the VGG19 pre-trained convolutional neural network was performed with all voice samples. The performance of the model using the polysyllabic sentence yielded the highest classification performance of all models. The coughing sound produced the lowest classification performance while the ability of the monosyllabic ‘ah’ sound to predict the recent COVID-19 fell between the other two vocalizations. The model using the polysyllabic sentence achieved 85% accuracy, 89% sensitivity, and 77% specificity. In conclusion, deep learning is able to detect the subtle change in voice features of COVID-19 patients after recent resolution of the disease.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal