ETRI Journal (Aug 2017)

Extensible Hierarchical Method of Detecting Interactive Actions for Video Understanding

  • Jinyoung Moon,
  • Junho Jin,
  • Yongjin Kwon,
  • Kyuchang Kang,
  • Jongyoul Park,
  • Kyoung Park

DOI
https://doi.org/10.4218/etrij.17.0116.0054
Journal volume & issue
Vol. 39, no. 4
pp. 502 – 513

Abstract

Read online

For video understanding, namely analyzing who did what in a video, actions along with objects are primary elements. Most studies on actions have handled recognition problems for a well‐trimmed video and focused on enhancing their classification performance. However, action detection, including localization as well as recognition, is required because, in general, actions intersect in time and space. In addition, most studies have not considered extensibility for a newly added action that has been previously trained. Therefore, proposed in this paper is an extensible hierarchical method for detecting generic actions, which combine object movements and spatial relations between two objects, and inherited actions, which are determined by the related objects through an ontology and rule based methodology. The hierarchical design of the method enables it to detect any interactive actions based on the spatial relations between two objects. The method using object information achieves an F‐measure of 90.27%. Moreover, this paper describes the extensibility of the method for a new action contained in a video from a video domain that is different from the dataset used.

Keywords