Improving sign language processing via few-shot machine learning

Grigory F. Shovkoplias; Dmitriy A. Strokov; Daniil V. Kasantsev; Aleksandra S. Vatian; Arip A. Asadulaev; Ivan V. Tomilov; Anatoly A. Shalyto; Natalia F. Gusarova

doi:10.17586/2226-1494-2022-22-3-559-566

Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki (Jun 2022)

Improving sign language processing via few-shot machine learning

Grigory F. Shovkoplias,
Dmitriy A. Strokov,
Daniil V. Kasantsev,
Aleksandra S. Vatian,
Arip A. Asadulaev,
Ivan V. Tomilov,
Anatoly A. Shalyto,
Natalia F. Gusarova

Affiliations

Grigory F. Shovkoplias: ORCiD; Engineer, ITMO University, Saint Petersburg, 197101, Russian Federation, sc 57222048908
Dmitriy A. Strokov: ORCiD; Student, ITMO University, Saint Petersburg, 197101, Russian Federation
Daniil V. Kasantsev: ORCiD; Senior Laboratory Assistant, ITMO University, Saint Petersburg, 197101, Russian Federation
Aleksandra S. Vatian: ORCiD; Associate Professor, ITMO University, Saint Petersburg, 197101, Russian Federation, sc 57191870868
Arip A. Asadulaev: ORCiD; Assistant, ITMO University, Saint Petersburg, 197101, Russian Federation
Ivan V. Tomilov: ORCiD; Senior Laboratory Assistant, ITMO University, Saint Petersburg, 197101, Russian Federation
Anatoly A. Shalyto: ORCiD; D. Sc., Full Professor, ITMO University, Saint Petersburg, 197101, Russian Federation, sc 57222048908
Natalia F. Gusarova: ORCiD; PhD, Senior Researcher, Associate Professor, ITMO University, Saint Petersburg, 197101, Russian Federation, sc 57162764200

DOI: https://doi.org/10.17586/2226-1494-2022-22-3-559-566
Journal volume & issue: Vol. 22, no. 3
pp. 559 – 566

Abstract

Read online

Improving the efficiency of communication of deaf and hard of hearing people by processing sign language using artificial intelligence is an important task both socially and technologically. One of the ways to solve this problem is a fairly cheap and accessible marker method. The method is based on the registration of electromyographic (EMG) muscle signals using bracelets worn on the arm. To improve the quality of recognition of gestures recorded by the marker method, a modification of the marker method is proposed — duplication of EMG sensors in combination with a lowframe machine learning approach. We experimentally study the possibilities of improving the quality of processing of sign language by duplicating EMG sensors as well as by reducing the volume of the dataset required for training machine learning tools. In the latter case, we compare several technologies of the few-shot approach. Our experiments show that training with few-shot neural nets on 56k samples we can achieve better results than training on random forest with 160k samples. The use of a minimum number of sensors in combination with few-shot signal processing techniques provides the possibility of organizing quick and cost-effective interaction with people with hearing and speech disabilities.

Published in Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki

ISSN: 2226-1494 (Print); 2500-0373 (Online)
Publisher: Saint Petersburg National Research University of Information Technologies, Mechanics and Optics (ITMO University)
Country of publisher: Russian Federation
LCC subjects: Science: Physics: Optics. Light; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://ntv.ifmo.ru/en/english.htm

About the journal

Abstract

Keywords