Multi-Semantic Discriminative Feature Learning for Sign Gesture Recognition Using Hybrid Deep Neural Architecture

E. Rajalakshmi; R. Elakkiya; V. Subramaniyaswamy; L. Prikhodko Alexey; Grif Mikhail; Maxim Bakaev; Ketan Kotecha; Lubna Abdelkareim Gabralla; Ajith Abraham

doi:10.1109/ACCESS.2022.3233671

IEEE Access (Jan 2023)

Multi-Semantic Discriminative Feature Learning for Sign Gesture Recognition Using Hybrid Deep Neural Architecture

E. Rajalakshmi,
R. Elakkiya,
V. Subramaniyaswamy,
L. Prikhodko Alexey,
Grif Mikhail,
Maxim Bakaev,
Ketan Kotecha,
Lubna Abdelkareim Gabralla,
Ajith Abraham

Affiliations

E. Rajalakshmi: School of Computing, SASTRA Deemed University, Thanjavur, Tamil Nadu, India
R. Elakkiya: ORCiD; School of Computing, SASTRA Deemed University, Thanjavur, Tamil Nadu, India
V. Subramaniyaswamy: ORCiD; School of Computing, SASTRA Deemed University, Thanjavur, Tamil Nadu, India
L. Prikhodko Alexey: Department of Automated Control Systems, Novosibirsk State Technical University, Novosibirsk, Russian
Grif Mikhail: ORCiD; Department of Automated Control Systems, Novosibirsk State Technical University, Novosibirsk, Russian
Maxim Bakaev: ORCiD; Department of Automated Control Systems, Novosibirsk State Technical University, Novosibirsk, Russian
Ketan Kotecha: ORCiD; Symbiosis Centre for Applied Artificial Intelligence, Symbiosis International (Deemed University), Pune, India
Lubna Abdelkareim Gabralla: ORCiD; Department of Computer Science and Information Technology, College of Applied, Princess Nourah Bint Abdulrahman University, Riyadh, Saudi Arabia
Ajith Abraham: ORCiD; Faculty of Computing and Data Sciences, FLAME University, Pune, India

DOI: https://doi.org/10.1109/ACCESS.2022.3233671
Journal volume & issue: Vol. 11
pp. 2226 – 2238

Abstract

Read online

The speech and hearing-impaired community use sign language as the primary means of communication. It is quite challenging for the general population to interpret or learn sign language completely. A sign language recognition system must be designed and developed to address this communication barrier. Most current sign language recognition systems rely on wearable sensors, keeping the recognition system unaffordable for most individuals. Moreover, the existing vision-based sign recognition frameworks do not consider all of the spatial and temporal information required for accurate recognition. A novel vison-based hybrid deep neural net methodology is proposed in this study for recognizing Indian and Russian sign gestures. The proposed framework is aimed to establish a single framework for tracking and extracting multi-semantic properties, such as non-manual components and manual co- articulations. Furthermore, spatial feature extraction from the sign gestures is deployed using a 3D deep neural net with atrous convolutions. The temporal and sequential feature extraction is carried out by employing attention-based Bi-LSTM. In addition, the distinguished abstract feature extraction is done using the modified autoencoders. The discriminative feature extraction for differentiating the sign gestures from unwanted transition gestures is done by leveraging the hybrid attention module. The experimentation of the proposed model has been carried out on the novel multi-signer Indo-Russian sign language dataset. The proposed sign language recognition framework with hybrid neural net yields better results than other state-of-the-art frameworks.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords