A Fusion of RGB Features and Local Descriptors for Object Detection in Road Scene

Vinh Dinh Nguyen

doi:10.1109/ACCESS.2024.3404248

IEEE Access (Jan 2024)

A Fusion of RGB Features and Local Descriptors for Object Detection in Road Scene

Vinh Dinh Nguyen

Affiliations

Vinh Dinh Nguyen: ORCiD; Department of Information Technology, FPT University, Can Tho Campus, Can Tho City, Vietnam

DOI: https://doi.org/10.1109/ACCESS.2024.3404248
Journal volume & issue: Vol. 12
pp. 72957 – 72967

Abstract

Read online

Many texture descriptors have been introduced in recent years to improve texture analysis and classification outcomes, which are important in many computer vision tasks including object recognition and detection, human detector, and especially in face recognition. Local pattern is a texture descriptor that can successfully extract distinctive texture features that possesses noise and illumination variance robustness. This paper focuses on making use of local pattern features in boosting object detection models in a multi-modal fusion paradigm to acquire reliable feature maps in forward propagation throughout the network regardless of variations in photo taking conditions. We propose an adaptive fusion architecture for RGB and Local Ternary Pattern information. This architecture leverage local pattern to enrich information of original feature maps and adapt to many object detection models. Our local pattern fusion network concentrates on backbone and neck modules with an simple and efficient operation. The notable accuracy advancement is 8.03% observed in Cascade R-CNN in KITTI Dataset. In difficult conditions, our fusion models significantly lift the original performance from 4.7% to 66.3% mAP score.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords