A Two-Stream Method for Human Action Recognition Using Facial Action Cues

Zhimao Lai; Yan Zhang; Xiubo Liang

doi:10.3390/s24216817

Sensors (Oct 2024)

A Two-Stream Method for Human Action Recognition Using Facial Action Cues

Zhimao Lai,
Yan Zhang,
Xiubo Liang

Affiliations

Zhimao Lai: School of Immigration Administration (Guangzhou), China People’s Police University, Guangzhou 510663, China
Yan Zhang: School of Immigration Administration, China People’s Police University, Langfang 065000, China
Xiubo Liang: School of Immigration Administration, China People’s Police University, Langfang 065000, China

DOI: https://doi.org/10.3390/s24216817
Journal volume & issue: Vol. 24, no. 21
p. 6817

Abstract

Read online

Human action recognition (HAR) is a critical area in computer vision with wide-ranging applications, including video surveillance, healthcare monitoring, and abnormal behavior detection. Current HAR methods predominantly rely on full-body data, which can limit their effectiveness in real-world scenarios where occlusion is common. In such situations, the face often remains visible, providing valuable cues for action recognition. This paper introduces Face in Action (FIA), a novel two-stream method that leverages facial action cues for robust action recognition under conditions of significant occlusion. FIA consists of an RGB stream and a landmark stream. The RGB stream processes facial image sequences using a fine-spatio-multitemporal (FSM) 3D convolution module, which employs smaller spatial receptive fields to capture detailed local facial movements and larger temporal receptive fields to model broader temporal dynamics. The landmark stream processes facial landmark sequences using a normalized temporal attention (NTA) module within an NTA-GCN block, enhancing the detection of key facial frames and improving overall recognition accuracy. We validate the effectiveness of FIA using the NTU RGB+D and NTU RGB+D 120 datasets, focusing on action categories related to medical conditions. Our experiments demonstrate that FIA significantly outperforms existing methods in scenarios with extensive occlusion, highlighting its potential for practical applications in surveillance and healthcare settings.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords