Exploiting spatio‐temporal knowledge for video action recognition

Huigang Zhang; Liuan Wang; Jun Sun

doi:10.1049/cvi2.12154

IET Computer Vision (Mar 2023)

Exploiting spatio‐temporal knowledge for video action recognition

Huigang Zhang,
Liuan Wang,
Jun Sun

Affiliations

Huigang Zhang: Fujitsu R&D Center Beijing China
Liuan Wang: Fujitsu R&D Center Beijing China
Jun Sun: Fujitsu R&D Center Beijing China

DOI: https://doi.org/10.1049/cvi2.12154
Journal volume & issue: Vol. 17, no. 2
pp. 222 – 230

Abstract

Read online

Abstract Action recognition has been a popular area of computer vision research in recent years. The goal of this task is to recognise human actions in video frames. Most existing methods often depend on the visual features and their relationships inside the videos. The extracted features only represent the visual information of the current video itself and cannot represent the general knowledge of particular actions beyond the video. Thus, there are some deviations in these features, and the recognition performance still requires improvement. In this sudy, we present a novel spatio‐temporal knowledge module (STKM) to endow the current methods with commonsense knowledge. To this end, we first collect hybrid external knowledge from universal fields, which contains both visual and semantic information. Then graph convolution networks (GCN) are used to represent and aggregate this knowledge. The GCNs involve (i) a spatial graph to capture spatial relations and (ii) a temporal graph to capture serial occurrence relations among actions. By integrating knowledge and visual features, we can get better recognition results. Experiments on AVA, UCF101‐24 and JHMDB datasets show the robustness and generalisation ability of STKM. The results report a new state‐of‐the‐art 32.0 mAP on AVA v2.1. On UCF101‐24 and JHMDB datasets, our method also improves by 1.5 AP and 2.6 AP, respectively, over the baseline method.

Published in IET Computer Vision

ISSN: 1751-9632 (Print); 1751-9640 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://ietresearch.onlinelibrary.wiley.com/journal/17519640

About the journal

Abstract

Keywords