A Multimodal Pairwise Discrimination Network for Cross-Domain Action Recognition

Fuhua Shang; Tao Tao Han; Feng Tian; Jun Wei Tao; Zan Gao

doi:10.1109/ACCESS.2020.3014691

IEEE Access (Jan 2020)

A Multimodal Pairwise Discrimination Network for Cross-Domain Action Recognition

Fuhua Shang,
Tao Tao Han,
Feng Tian,
Jun Wei Tao,
Zan Gao

Affiliations

Fuhua Shang: School of Computer and Information Technology, Northeast Petroleum University, Daqing, China
Tao Tao Han: Key Laboratory of Computer Vision and System, Ministry of Education, Tianjin University of Technology, Tianjin, China
Feng Tian: School of Computer and Information Technology, Northeast Petroleum University, Daqing, China
Jun Wei Tao: IRay Technology Company Ltd., Yantai, China
Zan Gao: ORCiD; Shandong AI Institute, Qilu University of Technology (Shandong Academy of Sciences), Jinan, China

DOI: https://doi.org/10.1109/ACCESS.2020.3014691
Journal volume & issue: Vol. 8
pp. 143545 – 143557

Abstract

Read online

In recent years, action recognition has become a hot research topic in the computer vision and machine learning domain. Despite many well-designed action recognition approaches have been proposed, we point out that some limitations still exist including the separated fusion of different Spatio-temporal features and the reconstruction classification model, and the requirement of similar environmental conditions when capturing the training and testing data. Thus, research interest has shifted from traditional action recognition towards cross-domain action recognition. To solve these limitations, in this work, we propose a novel multimodal pairwise discrimination network (short for MPD) for cross-domain action recognition that is an end-to-end network architecture. In MPD, it can jointly fuse different Spatio-temporal features from the video, learn domain invariant features for different action domains (source and target domains), and build the classification model. To characterize the shift between these domains, subnetwork parameters in corresponding layers of MPD are required to be relevant, but not identical. Besides, the domain invariant feature discrimination needs to be improved. Extensive experimental results on two different public benchmarks including indoor environment and outdoor environment demonstrate that our MPD solution can significantly outperform state-of-the-art methods with a 4% to 20% improvement in average accuracy.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords