Applied AI Letters (Dec 2021)

Explainable activity recognition in videos: Lessons learned

  • Chiradeep Roy,
  • Mahsan Nourani,
  • Donald R. Honeycutt,
  • Jeremy E. Block,
  • Tahrima Rahman,
  • Eric D. Ragan,
  • Nicholas Ruozzi,
  • Vibhav Gogate

DOI
https://doi.org/10.1002/ail2.59
Journal volume & issue
Vol. 2, no. 4
pp. n/a – n/a

Abstract

Read online

Abstract We consider the following activity recognition task: given a video, infer the set of activities being performed in the video and assign each frame to an activity. This task can be solved using modern deep learning architectures based on neural networks or conventional classifiers such as linear models and decision trees. While neural networks exhibit superior predictive performance as compared with decision trees and linear models, they are also uninterpretable and less explainable. We address this accuracy‐explanability gap using a novel framework that feeds the output of a deep neural network to an interpretable, tractable probabilistic model called dynamic cutset networks, and performs joint reasoning over the two to answer questions. The neural network helps achieve high accuracy while dynamic cutset networks because of their polytime probabilistic reasoning capabilities make the system more explainable. We demonstrate the efficacy of our approach by using it to build three prototype systems that solve human‐machine tasks having varying levels of difficulty using cooking videos as an accessible domain. We describe high‐level technical details and key lessons learned in our human subjects evaluations of these systems.

Keywords