IEEE Open Journal of Instrumentation and Measurement (Jan 2025)

Hydra-Mask-RCNN: An Adaptive HydraNet Architecture for Autonomous Aerial Vehicle Object Detection

  • Sara Naseri Golestani,
  • Mahdi SadeghiBakhi,
  • King Fai Ma,
  • Henry Leung

DOI
https://doi.org/10.1109/OJIM.2024.3502886
Journal volume & issue
Vol. 4
pp. 1 – 12

Abstract

Read online

Environmental monitoring is essential for understanding and mitigating the impact of human activities on the planet, as well as for developing effective strategies for sustainable development and conservation. Accurate object detection in aerial images is crucial for environmental monitoring and surveillance using autonomous aerial vehicles (AAVs). However, existing methods, including Mask R-CNN (MRCNN) and you-only-look-once (YOLO), struggle to detect small- and medium-sized objects from AAV sensors, limiting their usability for AAV surveillance. We propose Hydra-MRCNN (HMRCNN), a multitask learning network that enhances detection precision for small- and medium-sized objects in aerial images. By integrating an adaptive branching network (ABN) with HydraNet, HMRCNN improves feature extraction and object detection capabilities. Evaluations on Microsoft Common Objects in Context (MS-COCO), Aerial-Cars, VisDrone, and Plastic in River datasets show significant improvements in average recall (AR) compared to baseline models, including MRCNN and YOLO. Our approach has important implications for environmental monitoring, enabling more accurate detection of objects relevant to transportation, security, traffic, pollution, and infrastructure management. With the growing use of AAVs in environmental surveillance, HMRCNN offers a valuable tool for enhancing environmental measurement and assessment capabilities. Our method improves detection performance by over 6% on AAV datasets, making it a valuable contribution to the field as the commercial AAV market is expected to grow from 25 billion to 50 billion in the next decade.

Keywords