MediaPipe Frame and Convolutional Neural Networks-Based Fingerspelling Detection in Mexican Sign Language

Tzeico J. Sánchez-Vicinaiz; Enrique Camacho-Pérez; Alejandro A. Castillo-Atoche; Mayra Cruz-Fernandez; José R. García-Martínez; Juvenal Rodríguez-Reséndiz

doi:10.3390/technologies12080124

Technologies (Aug 2024)

MediaPipe Frame and Convolutional Neural Networks-Based Fingerspelling Detection in Mexican Sign Language

Tzeico J. Sánchez-Vicinaiz,
Enrique Camacho-Pérez,
Alejandro A. Castillo-Atoche,
Mayra Cruz-Fernandez,
José R. García-Martínez,
Juvenal Rodríguez-Reséndiz

Affiliations

Tzeico J. Sánchez-Vicinaiz: Faculty of Engineering, Autonomous University of Yucatan, Parque Santa Lucia, Centro, Mérida 97000, Mexico
Enrique Camacho-Pérez: Faculty of Engineering, Autonomous University of Yucatan, Parque Santa Lucia, Centro, Mérida 97000, Mexico
Alejandro A. Castillo-Atoche: Faculty of Engineering, Autonomous University of Yucatan, Parque Santa Lucia, Centro, Mérida 97000, Mexico
Mayra Cruz-Fernandez: Red de Investigación OAC Optimización, Automatización y Control, Carretera Estatal 420 S/N, El Marqués 76240, Mexico
José R. García-Martínez: Facultad de Ingeniería en Electrónica y Comunicaciones, Universidad Veracruzana, Poza Rica 93390, Mexico
Juvenal Rodríguez-Reséndiz: Faculty of Engineering, Autonomous University of Querétaro, Querétaro 76010, Mexico

DOI: https://doi.org/10.3390/technologies12080124
Journal volume & issue: Vol. 12, no. 8
p. 124

Abstract

Read online

This research proposes implementing a system to recognize the static signs of the Mexican Sign Language (MSL) dactylological alphabet using the MediaPipe frame and Convolutional Neural Network (CNN) models to correctly interpret the letters that represent the manual signals coming from a camera. The development of these types of studies allows the implementation of technological advances in artificial intelligence and computer vision in teaching Mexican Sign Language (MSL). The best CNN model achieved an accuracy of 83.63% over the sets of 336 test images. In addition, considering samples of each letter, the following results are obtained: an accuracy of 84.57%, a sensitivity of 83.33%, and a specificity of 99.17%. The advantage of this system is that it could be implemented on low-consumption equipment, carrying out the classification in real-time, contributing to the accessibility of its use.

Published in Technologies

ISSN: 2227-7080 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology
Website: http://www.mdpi.com/journal/technologies

About the journal

Abstract

Keywords