Applied Sciences (Oct 2022)

Dataset Transformation System for Sign Language Recognition Based on Image Classification Network

  • Sang-Geun Choi,
  • Yeonji Park,
  • Chae-Bong Sohn

DOI
https://doi.org/10.3390/app121910075
Journal volume & issue
Vol. 12, no. 19
p. 10075

Abstract

Read online

Among the various fields where deep learning is used, there are challenges to be solved in motion recognition. One is that it is difficult to manage because of the vast amount of data. Another is that it takes a long time to learn due to the complex network and the large amount of data. To solve the problems, we propose a dataset transformation system. Sign language recognition was implemented to evaluate the performance of this system. The system consists of three steps: pose estimation, normalization, and spatial–temporal map (STmap) generation. STmap is a method of simultaneously expressing temporal data and spatial data in one image. In addition, the accuracy of the model was improved, and the error sensitivity was lowered through the data augmentation process. Through the proposed method, it was possible to reduce the dataset from 94.39 GB to 954 MB. It corresponds to approximately 1% of the original. When the dataset created through the proposed method is trained on the image classification model, the sign language recognition accuracy is 84.5%.

Keywords