Feature Fusion for Dual-Stream Cooperative Action Recognition

Dong Chen; Mengtao Wu; Tao Zhang; Chuanqi Li

doi:10.1109/access.2023.3325401

IEEE Access (Jan 2023)

Feature Fusion for Dual-Stream Cooperative Action Recognition

Dong Chen,
Mengtao Wu,
Tao Zhang,
Chuanqi Li

Affiliations

Dong Chen: ORCiD; College of Computer Science and Engineering, Guangxi Normal University, Guilin, China
Mengtao Wu: ORCiD; College of Physics and Electronic Engineering, Nanning Normal University, Nanning, China
Tao Zhang: College of Physics and Electronic Engineering, Nanning Normal University, Nanning, China
Chuanqi Li: College of Computer Science and Engineering, Guangxi Normal University, Guilin, China

DOI: https://doi.org/10.1109/access.2023.3325401
Journal volume & issue: Vol. 11
pp. 116732 – 116740

Abstract

Read online

Currently, the primary methods for action recognition involve RGB-based approaches, pose-based approaches (e.g., skeleton coordinates), and multi-stream fusion methods. In this paper, we propose a novel action recognition framework based on both RGB images and motion pose images to enhance the accuracy of action recognition in videos. As a single feature representation fail to effectively capture motion trends and image variation information, it cannot accurately reflect expected action judgments in real-world scenarios. Therefore, we utilize the appearance features of video frames and the motion variation features of the subject, aiming to cooperate the action itself with appearance information for precise action recognition. We construct video representations based on local spatiotemporal features and global features, and utilize the ResNet backbone network and Temporal Shift Module (TSM) to extract action representations from multi-stream information. Driven by the motion features, the fusion of multi-stream information achieves effective expression of motion features. Experimental results on public datasets demonstrate the effectiveness of our proposed method. It achieves competitive performance compared to state-of-the-art techniques while maintaining a less complex and more interpretable model. Overall, our approach demonstrates superior effectiveness.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords