Empowering Deaf-Hearing Communication: Exploring Synergies between Predictive and Generative AI-Based Strategies towards (<i>Portuguese</i>) Sign Language Interpretation

Telmo Adão; João Oliveira; Somayeh Shahrabadi; Hugo Jesus; Marco Fernandes; Ângelo Costa; Vânia Ferreira; Martinho Fradeira Gonçalves; Miguel A. Guevara Lopéz; Emanuel Peres; Luís Gonzaga Magalhães

doi:10.3390/jimaging9110235

Journal of Imaging (Oct 2023)

Empowering Deaf-Hearing Communication: Exploring Synergies between Predictive and Generative AI-Based Strategies towards (<i>Portuguese</i>) Sign Language Interpretation

Telmo Adão,
João Oliveira,
Somayeh Shahrabadi,
Hugo Jesus,
Marco Fernandes,
Ângelo Costa,
Vânia Ferreira,
Martinho Fradeira Gonçalves,
Miguel A. Guevara Lopéz,
Emanuel Peres,
Luís Gonzaga Magalhães

Affiliations

Telmo Adão: Department of Engineering, School of Sciences and Technology, University of Trás-os-Montes e Alto Douro, 5000-801 Vila Real, Portugal
João Oliveira: Centro de Computação Gráfica-CCG/zgdv, University of Minho, Campus de Azurém, Edifício 14, 4800-058 Guimarães, Portugal
Somayeh Shahrabadi: Centro de Computação Gráfica-CCG/zgdv, University of Minho, Campus de Azurém, Edifício 14, 4800-058 Guimarães, Portugal
Hugo Jesus: Centro de Computação Gráfica-CCG/zgdv, University of Minho, Campus de Azurém, Edifício 14, 4800-058 Guimarães, Portugal
Marco Fernandes: Polytechnic Institute of Bragança, School of Communication, Administration and Tourism, Campus do Cruzeiro, 5370-202 Mirandela, Portugal
Ângelo Costa: Associação Portuguesa de Surdos (APS), 1600-796 Lisboa, Portugal
Vânia Ferreira: Associação Portuguesa de Surdos (APS), 1600-796 Lisboa, Portugal
Martinho Fradeira Gonçalves: Polytechnic Institute of Bragança, School of Communication, Administration and Tourism, Campus do Cruzeiro, 5370-202 Mirandela, Portugal
Miguel A. Guevara Lopéz: ALGORITMI Research Centre/LASI, University of Minho, 4800-058 Guimarães, Portugal
Emanuel Peres: Department of Engineering, School of Sciences and Technology, University of Trás-os-Montes e Alto Douro, 5000-801 Vila Real, Portugal
Luís Gonzaga Magalhães: ALGORITMI Research Centre/LASI, University of Minho, 4800-058 Guimarães, Portugal

DOI: https://doi.org/10.3390/jimaging9110235
Journal volume & issue: Vol. 9, no. 11
p. 235

Abstract

Read online

Communication between Deaf and hearing individuals remains a persistent challenge requiring attention to foster inclusivity. Despite notable efforts in the development of digital solutions for sign language recognition (SLR), several issues persist, such as cross-platform interoperability and strategies for tokenizing signs to enable continuous conversations and coherent sentence construction. To address such issues, this paper proposes a non-invasive Portuguese Sign Language (Língua Gestual Portuguesa or LGP) interpretation system-as-a-service, leveraging skeletal posture sequence inference powered by long-short term memory (LSTM) architectures. To address the scarcity of examples during machine learning (ML) model training, dataset augmentation strategies are explored. Additionally, a buffer-based interaction technique is introduced to facilitate LGP terms tokenization. This technique provides real-time feedback to users, allowing them to gauge the time remaining to complete a sign, which aids in the construction of grammatically coherent sentences based on inferred terms/words. To support human-like conditioning rules for interpretation, a large language model (LLM) service is integrated. Experiments reveal that LSTM-based neural networks, trained with 50 LGP terms and subjected to data augmentation, achieved accuracy levels ranging from 80% to 95.6%. Users unanimously reported a high level of intuition when using the buffer-based interaction strategy for terms/words tokenization. Furthermore, tests with an LLM—specifically ChatGPT—demonstrated promising semantic correlation rates in generated sentences, comparable to expected sentences.

Published in Journal of Imaging

ISSN: 2313-433X (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Photography; Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.mdpi.com/journal/jimaging

About the journal

Abstract

Keywords