A deep learning method for video‐based action recognition

Guanwen Zhang; Yukun Rao; Changhao Wang; Wei Zhou; Xiangyang Ji

doi:10.1049/ipr2.12303

IET Image Processing (Dec 2021)

A deep learning method for video‐based action recognition

Guanwen Zhang,
Yukun Rao,
Changhao Wang,
Wei Zhou,
Xiangyang Ji

Affiliations

Guanwen Zhang: School of Electronics and Information Northwestern Polytechnical University Xi'an China
Yukun Rao: School of Electronics and Information Northwestern Polytechnical University Xi'an China
Changhao Wang: School of Electronics and Information Northwestern Polytechnical University Xi'an China
Wei Zhou: School of Electronics and Information Northwestern Polytechnical University Xi'an China
Xiangyang Ji: Department of Automation Tsinghua University Beijing China

DOI: https://doi.org/10.1049/ipr2.12303
Journal volume & issue: Vol. 15, no. 14
pp. 3498 – 3511

Abstract

Read online

Abstract In this paper, a deep learning method for video‐based action recognition is proposed. On the one hand, boundary compensation on the basis of a deep neural network is performed to achieve action proposal. Boundary compensation considering non‐maximum suppression according to sliding window priority is applied to remove redundant windows. To accurately detect boundaries, a boundary compensation network is established with multiple networks to process different numbers of segments. On the other hand, action recognition based on the resultant action proposals is performed. To further utilise boundary compensation, three methods are introduced for key frame selection. Optical flow and RGB features are combined via a channel fusion to realise feature representation. A two‐stream network with a spatiotemporal structure is adopted for action recognition. The proposed method is evaluated on three public datasets. The experimental results demonstrate that the proposed method achieves a superior performance to that of state‐of‐the‐art methods.

Published in IET Image Processing

ISSN: 1751-9659 (Print); 1751-9667 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Technology: Photography; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://ietresearch.onlinelibrary.wiley.com/journal/17519667

About the journal

Abstract

Keywords