Video frame feeding approach for validating the performance of an object detection model in real-world conditions

Keerthi Jayan; B. Muruganantham

doi:10.1080/00051144.2024.2314928

Automatika (Apr 2024)

Video frame feeding approach for validating the performance of an object detection model in real-world conditions

Keerthi Jayan,
B. Muruganantham

Affiliations

Keerthi Jayan: Department of Computing Technologies, School of Computing, SRM Institute of Science and Technology, Kattankulathur, Tamil Nadu, India
B. Muruganantham: Department of Computing Technologies, School of Computing, SRM Institute of Science and Technology, Kattankulathur, Tamil Nadu, India

DOI: https://doi.org/10.1080/00051144.2024.2314928
Journal volume & issue: Vol. 65, no. 2
pp. 627 – 640

Abstract

Read online

The challenge of evaluating deep learning-based object detection models in complex traffic scenarios, characterized by changing weather and lighting conditions, is addressed in this study. Real-world testing proves time and cost-intensive, leading to the proposal of a Video Frame Feeding (VFF) approach as a solution. The proposed Video Frame Feeding approach acts as a bridge between object detection models and simulated environments, enabling the generation of realistic scenarios. Leveraging the CarMaker (CM) tool to simulate realistic scenarios, the framework utilizes a virtual camera to capture the simulated environment and feed video frames to an object identification model. The VFF algorithm, with automated validation using simulated ground truth data, enhances detection accuracy to over 95% at 30 frames per second within 130 meters. Employing the You Only Look Once (YOLO) version 4 and the German Traffic Sign Recognition Benchmark dataset, the study assesses a traffic signboard identification model across various climatic conditions. Notably, the VFF algorithm improves accuracy by 2% to 5% in challenging scenarios like foggy days and nights. This innovative approach not only identifies object detection issues efficiently but also offers a versatile solution applicable to any object detection model, promising improved dataset quality and robustness for enhanced model performance.

Published in Automatika

ISSN: 0005-1144 (Print); 1848-3380 (Online)
Publisher: Taylor & Francis Group
Country of publisher: United Kingdom
LCC subjects: Technology: Mechanical engineering and machinery: Control engineering systems. Automatic machinery (General); Technology: Technology (General): Industrial engineering. Management engineering: Automation
Website: https://www.tandfonline.com/journals/taut

About the journal

Abstract

Keywords