IET Computer Vision (Mar 2016)

Weighted averaging fusion for multi‐view skeletal data and its application in action recognition

  • Nur Aziza Azis,
  • Young‐Seob Jeong,
  • Ho‐Jin Choi,
  • Youssef Iraqi

DOI
https://doi.org/10.1049/iet-cvi.2015.0146
Journal volume & issue
Vol. 10, no. 2
pp. 134 – 142

Abstract

Read online

Existing studies in skeleton‐based action recognition mainly utilise skeletal data taken from a single camera. Since the quality of skeletal tracking of a single camera is noisy and unreliable, however, combining data from multiple cameras can improve the tracking quality and hence increase the recognition accuracy. In this study, the authors propose a method called weighted averaging fusion which merges skeletal data of two or more camera views. The method first evaluates the reliability of a set of corresponding joints based on their distances to the centroid, then computes the weighted average of selected joints, that is, each joint is weighted by the overall reliability of the camera reporting the joint. Such obtained, fused skeletal data are used as the input to the action recognition step. Experiments using various frame‐level features and testing schemes show that more than 10% improvement can be achieved in the action recognition accuracy using these fused skeletal data as compared with the single‐view case.

Keywords