Technologies (Aug 2024)

MediaPipe Frame and Convolutional Neural Networks-Based Fingerspelling Detection in Mexican Sign Language

  • Tzeico J. Sánchez-Vicinaiz,
  • Enrique Camacho-Pérez,
  • Alejandro A. Castillo-Atoche,
  • Mayra Cruz-Fernandez,
  • José R. García-Martínez,
  • Juvenal Rodríguez-Reséndiz

DOI
https://doi.org/10.3390/technologies12080124
Journal volume & issue
Vol. 12, no. 8
p. 124

Abstract

Read online

This research proposes implementing a system to recognize the static signs of the Mexican Sign Language (MSL) dactylological alphabet using the MediaPipe frame and Convolutional Neural Network (CNN) models to correctly interpret the letters that represent the manual signals coming from a camera. The development of these types of studies allows the implementation of technological advances in artificial intelligence and computer vision in teaching Mexican Sign Language (MSL). The best CNN model achieved an accuracy of 83.63% over the sets of 336 test images. In addition, considering samples of each letter, the following results are obtained: an accuracy of 84.57%, a sensitivity of 83.33%, and a specificity of 99.17%. The advantage of this system is that it could be implemented on low-consumption equipment, carrying out the classification in real-time, contributing to the accessibility of its use.

Keywords