Integrating Vision Transformer-Based Bilinear Pooling and Attention Network Fusion of RGB and Skeleton Features for Human Action Recognition

Yaohui Sun; Weiyao Xu; Xiaoyi Yu; Ju Gao; Ting Xia

doi:10.1007/s44196-023-00292-9

International Journal of Computational Intelligence Systems (Jul 2023)

Integrating Vision Transformer-Based Bilinear Pooling and Attention Network Fusion of RGB and Skeleton Features for Human Action Recognition

Yaohui Sun,
Weiyao Xu,
Xiaoyi Yu,
Ju Gao,
Ting Xia

Affiliations

Yaohui Sun: School of Automation Engineering, University of Electronic Science and Technology of China
Weiyao Xu: School of Opto-Electronic Engineering, Zaozhuang University
Xiaoyi Yu: School of Opto-Electronic Engineering, Zaozhuang University
Ju Gao: School of Opto-Electronic Engineering, Zaozhuang University
Ting Xia: School of Opto-Electronic Engineering, Zaozhuang University

DOI: https://doi.org/10.1007/s44196-023-00292-9
Journal volume & issue: Vol. 16, no. 1
pp. 1 – 11

Abstract

Read online

Abstract In this paper, we propose VT-BPAN, a novel approach that combines the capabilities of Vision Transformer (VT), bilinear pooling, and attention network fusion for effective human action recognition (HAR). The proposed methodology significantly enhances the accuracy of activity recognition through the following advancements: (1) The introduction of an effective two-stream feature pooling and fusion mechanism that combines RGB frames and skeleton data to augment the spatial–temporal feature representation. (2) The development of a spatial lightweight vision transformer that mitigates computational costs. The evaluation of this framework encompasses three widely employed video action datasets, demonstrating that the proposed approach achieves performance on par with state-of-the-art methods.

Published in International Journal of Computational Intelligence Systems

ISSN: 1875-6891 (Print); 1875-6883 (Online)
Publisher: Springer
Country of publisher: Switzerland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.springer.com/journal/44196

About the journal

Abstract

Keywords