Improving Human Activity Recognition Integrating LSTM With Different Data Sources: Features, Object Detection and Skeleton Tracking

Jaime Duque Domingo; Jaime Gomez-Garcia-Bermejo; Eduardo Zalama

doi:10.1109/ACCESS.2022.3186465

IEEE Access (Jan 2022)

Improving Human Activity Recognition Integrating LSTM With Different Data Sources: Features, Object Detection and Skeleton Tracking

Jaime Duque Domingo,
Jaime Gomez-Garcia-Bermejo,
Eduardo Zalama

Affiliations

Jaime Duque Domingo: ORCiD; CARTIF Foundation, División de Sistemas Industriales y Digitales, Parque Tecnológico de Boecillo, Valladolid, Spain
Jaime Gomez-Garcia-Bermejo: ORCiD; CARTIF Foundation, División de Sistemas Industriales y Digitales, Parque Tecnológico de Boecillo, Valladolid, Spain
Eduardo Zalama: CARTIF Foundation, División de Sistemas Industriales y Digitales, Parque Tecnológico de Boecillo, Valladolid, Spain

DOI: https://doi.org/10.1109/ACCESS.2022.3186465
Journal volume & issue: Vol. 10
pp. 68213 – 68230

Abstract

Read online

Over the past few years, technologies in the field of computer vision have greatly advanced. The use of deep neural networks, together with the development of computing capabilities, has made it possible to solve problems of great interest to society. In this work, we focus on one such problem that has seen a great development, the recognition of actions in live videos. Although the problem has been oriented in different ways in the literature, we have focused on indoor residential environments, such as a house or a nursing home. Our system can be used to understand what actions a person or group of people are carrying out. Two of the approaches used to solve the problem have been 3D convolution networks and recurrent networks. In our case, we have created a model that accurately combines several recurrent networks with processed data from different techniques: image feature extraction, object detection and people’s skeletons. The need to integrate these three techniques arises from the search to improve the detection of certain actions by taking advantage of the best recognition offered by each of the methods. In a complete experimentation, where several techniques have been evaluated against different datasets, the classification of the actions has been improved with respect to the existing models.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords