IEEE Access (Jan 2021)

Closed-Loop Region of Interest Enabling High Spatial and Temporal Resolutions in Object Detection and Tracking via Wireless Camera

  • Jack Chen,
  • Hen-Wei Huang,
  • Philipp Rupp,
  • Anjali Sinha,
  • Claas Ehmke,
  • Giovanni Traverso

DOI
https://doi.org/10.1109/ACCESS.2021.3086499
Journal volume & issue
Vol. 9
pp. 87340 – 87350

Abstract

Read online

The trade-off between spatial and temporal resolution remains a fundamental challenge in machine vision. A captured image often contains a significant amount of redundant information, and only a small region of interest (ROI) is necessary for object detection and tracking. In this paper, we first systematically characterize the effects of ROI on camera capturing, data transmission, and image processing. We then present the closed-loop ROI algorithm capable of high spatial and temporal resolution as well as wide scanning field of view (FOV) in single and multi-object detection and tracking via real-time wireless video streaming. With the feedback from real-time object tracking, the wireless camera is able to capture and transmit only the ROI which in turn enhances both the spatial and temporal resolution in object tracking. In addition, the proposed approach can still maintain a large FOV by processing regions outside of the ROI at lower spatial and temporal resolutions. When applied to a high spatial resolution wireless stream (5 MegaPixels), the closed-loop ROI algorithm improves the temporal resolution by up to $10\times $ (from 2.4FPS to 22.5FPS). Specifically, camera processing is improved by up to $4.7\times $ , data transmission is improved by up to $160\times $ , and PC processing is improved by up to $2.5\times $ . In a person tracking experiment, the closed-loop ROI algorithm enables a wide-angle camera to outperform both a normal wide-angle camera–which suffers from poor temporal resolution and motion blur–and a pan & tilt camera–which cannot automatically refresh tracking after the tracking is lost.

Keywords