Improved Action Recognition with Separable Spatio-Temporal Attention Using Alternative Skeletal and Video Pre-Processing

Pau Climent-Pérez; Francisco Florez-Revuelta

doi:10.3390/s21031005

Sensors (Feb 2021)

Improved Action Recognition with Separable Spatio-Temporal Attention Using Alternative Skeletal and Video Pre-Processing

Pau Climent-Pérez,
Francisco Florez-Revuelta

Affiliations

Pau Climent-Pérez: Department of Computing Technology, University of Alicante, P.O. Box 99, E-03080 Alicante, Spain
Francisco Florez-Revuelta: Department of Computing Technology, University of Alicante, P.O. Box 99, E-03080 Alicante, Spain

DOI: https://doi.org/10.3390/s21031005
Journal volume & issue: Vol. 21, no. 3
p. 1005

Abstract

Read online

The potential benefits of recognising activities of daily living from video for active and assisted living have yet to be fully untapped. These technologies can be used for behaviour understanding, and lifelogging for caregivers and end users alike. The recent publication of realistic datasets for this purpose, such as the Toyota Smarthomes dataset, calls for pushing forward the efforts to improve action recognition. Using the separable spatio-temporal attention network proposed in the literature, this paper introduces a view-invariant normalisation of skeletal pose data and full activity crops for RGB data, which improve the baseline results by 9.5% (on the cross-subject experiments), outperforming state-of-the-art techniques in this field when using the original unmodified skeletal data in dataset. Our code and data are available online.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords