Deep learning based assistive technology on audio visual speech recognition for hearing impaired

L Ashok Kumar; D Karthika Renuka; S Lovelyn Rose; M C Shunmuga priya; I Made Wartana

International Journal of Cognitive Computing in Engineering (Jun 2022)

Deep learning based assistive technology on audio visual speech recognition for hearing impaired

L Ashok Kumar,
D Karthika Renuka,
S Lovelyn Rose,
M C Shunmuga priya,
I Made Wartana

Affiliations

L Ashok Kumar: Department of Electrical and Electronics Engineering, PSG College Of Technology, Coimbatore, India; Corresponding author.
D Karthika Renuka: Department of Information Technology, PSG College Of Technology, Coimbatore, India
S Lovelyn Rose: Department of Computer Science and Engineering, PSG College Of Technology, Coimbatore, India
M C Shunmuga priya: Department of Information Technology, PSG College Of Technology, Coimbatore, India
I Made Wartana: Department of Electrical Engineering, National Institute of Technology (ITN), India

Journal volume & issue: Vol. 3
pp. 24 – 30

Abstract

Read online

Assistive technology would be an immense benefit for hearing impaired people by using Audio Visual Speech Recognition (AVSR). Around 466 million people worldwide suffer from hearing loss. Hearing impaired student rely on lip reading for understanding the speech. Lack of trained sign language facilitators and high cost of assistive devices are some of the major challenges faced by hearing impaired students. In this work, we have identified a visual speech recognition technique using cutting edge deep learning models. Moreover, the existing VSR techniques are erroneous. Hence to address the gaps identified, we propose a novel technique by fusion the results from audio and visual speech. This study proposes a new deep learning based audio visual speech recognition model for efficient lip reading. In this paper, an effort has been made to improve the performance of the system significantly by achieving a lowered word error rate of about 6.59% for ASR system and accuracy of about 95% using lip reading model.

Published in International Journal of Cognitive Computing in Engineering

ISSN: 2666-3074 (Online)
Publisher: KeAi Communications Co., Ltd.
Country of publisher: China
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.keaipublishing.com/en/journals/international-journal-of-cognitive-computing-in-engineering/

About the journal

Abstract

Keywords