GestureVLAD: Combining Unsupervised Features Representation and Spatio-Temporal Aggregation for Doppler-Radar Gesture Recognition

Abel Diaz Berenguer; Meshia Cedric Oveneke; Habib-Ur-Rehman Khalid; Mitchel Alioscha-Perez; Andre Bourdoux; Hichem Sahli

doi:10.1109/ACCESS.2019.2942305

IEEE Access (Jan 2019)

GestureVLAD: Combining Unsupervised Features Representation and Spatio-Temporal Aggregation for Doppler-Radar Gesture Recognition

Abel Diaz Berenguer,
Meshia Cedric Oveneke,
Habib-Ur-Rehman Khalid,
Mitchel Alioscha-Perez,
Andre Bourdoux,
Hichem Sahli

Affiliations

Abel Diaz Berenguer: ORCiD; Department of Electronics and Informatics (ETRO), VUP-NPU Joint Audio-Visual Signal Processing (AVSP) Research Laboratory, Vrije Universiteit Brussel (VUB), Brussels, Belgium
Meshia Cedric Oveneke: Department of Electronics and Informatics (ETRO), VUP-NPU Joint Audio-Visual Signal Processing (AVSP) Research Laboratory, Vrije Universiteit Brussel (VUB), Brussels, Belgium
Habib-Ur-Rehman Khalid: Department of Electronics and Informatics (ETRO), VUP-NPU Joint Audio-Visual Signal Processing (AVSP) Research Laboratory, Vrije Universiteit Brussel (VUB), Brussels, Belgium
Mitchel Alioscha-Perez: Department of Electronics and Informatics (ETRO), VUP-NPU Joint Audio-Visual Signal Processing (AVSP) Research Laboratory, Vrije Universiteit Brussel (VUB), Brussels, Belgium
Andre Bourdoux: Interuniversity Microelectronics Centre (IMEC), Heverlee, Belgium
Hichem Sahli: ORCiD; Department of Electronics and Informatics (ETRO), VUP-NPU Joint Audio-Visual Signal Processing (AVSP) Research Laboratory, Vrije Universiteit Brussel (VUB), Brussels, Belgium

DOI: https://doi.org/10.1109/ACCESS.2019.2942305
Journal volume & issue: Vol. 7
pp. 137122 – 137135

Abstract

Read online

In this paper we propose a novel framework to process Doppler-radar signals for hand gesture recognition. Doppler-radar sensors provide many advantages over other emerging sensing modalities, including low development costs and high sensitivity to capture subtle gestures with precision. Furthermore, they have attractive properties for ubiquitous deployment and can be conveniently embedded into different devices. In this scope, current recognition methods still rely in deep CNN-LSTM and 3D CNN-LSTM structures that require sufficient labelled data to optimize millions of parameters and significant amount of computational resources for inference; which limits their deployment. Indeed, subtle gestures recognition is a challenging task due to the high variability of gestures among different subjects. To overcome the challenges in the recognition task and the limitations of the current methods, we propose a shallow learning approach for gesture recognition, that is based on unsupervised range-Doppler features representation, along with a learnable pooling aggregation via NetVLAD. The proposed framework can encode extremely valuable information across time, and results in features that are highly discriminative for hand gesture recognition. Experimentation on publicly available Doppler-radar data shows that the proposed framework outperforms state-of-the-art approaches in terms of recognition accuracy and speed for sequence-level hand gesture classification.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords