Single Shot Detector CNN and Deep Dilated Masks for Vision-Based Hand Gesture Recognition From Video Sequences

Fahmid Al Farid; Noramiza Hashim; Junaidi Bin Abdullah; Md. Roman Bhuiyan; Magzhan Kairanbay; Zulfadzli Yusoff; Hezerul Abdul Karim; Sarina Mansor; MD. Tanjil Sarker; Gobbi Ramasamy

doi:10.1109/ACCESS.2024.3360857

IEEE Access (Jan 2024)

Single Shot Detector CNN and Deep Dilated Masks for Vision-Based Hand Gesture Recognition From Video Sequences

Fahmid Al Farid,
Noramiza Hashim,
Junaidi Bin Abdullah,
Md. Roman Bhuiyan,
Magzhan Kairanbay,
Zulfadzli Yusoff,
Hezerul Abdul Karim,
Sarina Mansor,
MD. Tanjil Sarker,
Gobbi Ramasamy

Affiliations

Fahmid Al Farid: Faculty of Engineering, Multimedia University, Cyberjaya, Malaysia
Noramiza Hashim: ORCiD; Faculty of Computing and Informatics, Multimedia University, Cyberjaya, Malaysia
Junaidi Bin Abdullah: Faculty of Computing and Informatics, Multimedia University, Cyberjaya, Malaysia
Md. Roman Bhuiyan: ORCiD; Faculty of Computing and Informatics, Multimedia University, Cyberjaya, Malaysia
Magzhan Kairanbay: Faculty of Engineering and Natural Sciences, Suleyman Demirel University (SDU), Almaty, Kazakhstan
Zulfadzli Yusoff: Faculty of Engineering, Multimedia University, Cyberjaya, Malaysia
Hezerul Abdul Karim: ORCiD; Faculty of Engineering, Multimedia University, Cyberjaya, Malaysia
Sarina Mansor: ORCiD; Faculty of Engineering, Multimedia University, Cyberjaya, Malaysia
MD. Tanjil Sarker: ORCiD; Faculty of Engineering, Multimedia University, Cyberjaya, Malaysia
Gobbi Ramasamy: ORCiD; Faculty of Engineering, Multimedia University, Cyberjaya, Malaysia

DOI: https://doi.org/10.1109/ACCESS.2024.3360857
Journal volume & issue: Vol. 12
pp. 28564 – 28574

Abstract

Read online

With an increasing number of people on the planet today, innovative human-computer interaction technologies and approaches may be employed to assist individuals in leading more fulfilling lives. Gesture-based technology has the potential to improve the safety and well-being of impaired people, as well as the general population. Recognizing gestures from video streams is a difficult problem because of the large degree of variation in the characteristics of each motion across individuals. In this article, we propose applying deep learning methods to recognize automated hand gestures using RGB and depth data. To train neural networks to detect hand gestures, any of these forms of data may be utilized. Gesture-based interfaces are more natural, intuitive, and straightforward. Earlier study attempted to characterize hand motions in a number of contexts. Our technique is evaluated using a vision-based gesture recognition system. In our suggested technique, image collection starts with RGB video and depth information captured with the Kinect sensor and is followed by tracking the hand using a single shot detector Convolutional Neural Network (SSD-CNN). When the kernel is applied, it creates an output value at each of the m $\times $ n locations. Using a collection of convolutional filters, each new feature layer generates a defined set of gesture detection predictions. After that, we perform deep dilation to make the gesture in the image masks more visible. Finally, hand gestures have been detected using the well-known classification technique SVM. Using deep learning we recognize hand gestures with higher accuracy of 93.68% in RGB passage, 83.45% in the depth passage, and 90.61% in RGB-D conjunction on the SKIG dataset compared to the state-of-the-art. In the context of our own created Different Camera Orientation Gesture (DCOG) dataset we got higher accuracy of 92.78% in RGB passage, 79.55% in the depth passage, and 88.56% in RGB-D conjunction for the gestures collected in 0-degree angle. Moreover, the framework intends to use unique methodologies to construct a superior vision-based hand gesture recognition system.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords