Probability‐based method for boosting human action recognition using scene context

Hong‐Bo Zhang; Qing Lei; Duan‐Sheng Chen; Bi‐Neng Zhong; Jialin Peng; Ji‐Xiang Du; Song‐Zhi Su

doi:10.1049/iet-cvi.2015.0420

IET Computer Vision (Sep 2016)

Probability‐based method for boosting human action recognition using scene context

Hong‐Bo Zhang,
Qing Lei,
Duan‐Sheng Chen,
Bi‐Neng Zhong,
Jialin Peng,
Ji‐Xiang Du,
Song‐Zhi Su

Affiliations

Hong‐Bo Zhang: Department of Computer Science and TechnologyHuaqiao UniversityFujianPeople's Republic of China
Qing Lei: Department of Computer Science and TechnologyHuaqiao UniversityFujianPeople's Republic of China
Duan‐Sheng Chen: Department of Computer Science and TechnologyHuaqiao UniversityFujianPeople's Republic of China
Bi‐Neng Zhong: Department of Computer Science and TechnologyHuaqiao UniversityFujianPeople's Republic of China
Jialin Peng: Department of Computer Science and TechnologyHuaqiao UniversityFujianPeople's Republic of China
Ji‐Xiang Du: Department of Computer Science and TechnologyHuaqiao UniversityFujianPeople's Republic of China
Song‐Zhi Su: Department of Information Science and TechnologyXiamen UniversityFujianPeople's Republic of China

DOI: https://doi.org/10.1049/iet-cvi.2015.0420
Journal volume & issue: Vol. 10, no. 6
pp. 528 – 536

Abstract

Read online

In this study, the authors investigate the possibility of boosting action recognition performance by exploiting the associated scene context. Towards this end, the authors model a scene as a mid‐level ‘middle layer’ in order to bridge action descriptors and action categories. This is achieved via a scene topic model, in which hybrid visual descriptors, including spatial–temporal action features and scene descriptors, are first extracted from a video sequence. Then, the authors learn a joint probability distribution between scene and action using a naive Bayes nearest neighbour algorithm, which is adopted to jointly infer the action categories online by combining off‐the‐shelf action recognition algorithms. The authors demonstrate the advantages of their approach by comparing it with state‐of‐the‐art approaches using several action recognition benchmarks.

Published in IET Computer Vision

ISSN: 1751-9632 (Print); 1751-9640 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://ietresearch.onlinelibrary.wiley.com/journal/17519640

About the journal

Abstract

Keywords