Artificial intelligence enabled smart mask for speech recognition for future hearing devices

Hira Hameed; Lubna; Muhammad Usman; Jalil Ur Rehman Kazim; Khaled Assaleh; Kamran Arshad; Amir Hussain; Muhammad Imran; Qammer H. Abbasi

doi:10.1038/s41598-024-81904-y

Scientific Reports (Dec 2024)

Artificial intelligence enabled smart mask for speech recognition for future hearing devices

Hira Hameed,
Lubna,
Muhammad Usman,
Jalil Ur Rehman Kazim,
Khaled Assaleh,
Kamran Arshad,
Amir Hussain,
Muhammad Imran,
Qammer H. Abbasi

Affiliations

Hira Hameed: James Watt School of Engineering, University of Glasgow
Lubna: James Watt School of Engineering, University of Glasgow
Muhammad Usman: School of Computing, Engineering and Built Environment, Glasgow Caledonian University
Jalil Ur Rehman Kazim: James Watt School of Engineering, University of Glasgow
Khaled Assaleh: Department of Electrical and Computer Engineering, College of Engineering and Information Technology, Ajman University
Kamran Arshad: Department of Electrical and Computer Engineering, College of Engineering and Information Technology, Ajman University
Amir Hussain: School of Computing, Edinburgh Napier University
Muhammad Imran: James Watt School of Engineering, University of Glasgow
Qammer H. Abbasi: James Watt School of Engineering, University of Glasgow

DOI: https://doi.org/10.1038/s41598-024-81904-y
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 11

Abstract

Read online

Abstract In recent years, Lip-reading has emerged as a significant research challenge. The aim is to recognise speech by analysing Lip movements. The majority of Lip-reading technologies are based on cameras and wearable devices. However, these technologies have well-known occlusion and ambient lighting limitations, privacy concerns as well as wearable device discomfort for subjects and disturb their daily routines. Furthermore, in the era of coronavirus (COVID-19), where face masks are the norm, vision-based and wearable-based technologies for hearing aids are ineffective. To address the fundamental limitations of camera-based and wearable-based systems, this paper proposes a Radio Frequency Identification (RFID)-based smart mask for a Lip-reading framework capable of reading Lips under face masks, enabling effective speech recognition and fostering conversational accessibility for individuals with hearing impairment. The system uses RFID technology to make Radio Frequency (RF) sensing-based Lip-reading possible. A smart RFID face mask is used to collect a dataset containing three different classes of vowels (A, E, I, O, U), Consonants (F, G, M, S), and words (Fish, Goat, Meal, Moon, Snake). The collected data are fed into well-known machine-learning models for classification. A high classification accuracy is achieved by individual classes and combined datasets. On the RFID combined dataset, the Random Forest model achieves a high classification accuracy of 80%.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal