Boost Correlation Features with 3D-MiIoU-Based Camera-LiDAR Fusion for MODT in Autonomous Driving

Kunpeng Zhang; Yanheng Liu; Fang Mei; Jingyi Jin; Yiming Wang

doi:10.3390/rs15040874

Remote Sensing (Feb 2023)

Boost Correlation Features with 3D-MiIoU-Based Camera-LiDAR Fusion for MODT in Autonomous Driving

Kunpeng Zhang,
Yanheng Liu,
Fang Mei,
Jingyi Jin,
Yiming Wang

Affiliations

Kunpeng Zhang: College of Computer Science and Technology, Jilin University, Changchun 130012, China
Yanheng Liu: College of Computer Science and Technology, Jilin University, Changchun 130012, China
Fang Mei: College of Computer Science and Technology, Jilin University, Changchun 130012, China
Jingyi Jin: College of Computer Science and Technology, Jilin University, Changchun 130012, China
Yiming Wang: College of Computer Science and Technology, Jilin University, Changchun 130012, China

DOI: https://doi.org/10.3390/rs15040874
Journal volume & issue: Vol. 15, no. 4
p. 874

Abstract

Read online

Three-dimensional (3D) object tracking is critical in 3D computer vision. It has applications in autonomous driving, robotics, and human–computer interaction. However, methods for using multimodal information among objects to increase multi-object detection and tracking (MOT) accuracy remain a critical focus of research. Therefore, we present a multimodal MOT framework for autonomous driving boost correlation multi-object detection and tracking (BcMODT) in this research study to provide more trustworthy features and correlation scores for real-time detection tracking using both camera and LiDAR measurement data. Specifically, we propose an end-to-end deep neural network using 2D and 3D data for joint object detection and association. A new 3D mixed IoU (3D-MiIoU) computational module is also developed to acquire more precise geometric affinity by increasing the aspect ratio and length-to-height ratio between linked frames. Meanwhile, a boost correlation feature (BcF) module is proposed for the affinity calculation of the appearance of similar objects, which comprises an appearance affinity calculation module for similar objects in adjacent frames that are calculated directly using the feature distance and feature direction’s similarity. The KITTI tracking benchmark shows that our method outperforms other methods with respect to tracking accuracy.

Published in Remote Sensing

ISSN: 2072-4292 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science
Website: http://www.mdpi.com/journal/remotesensing/

About the journal

Abstract

Keywords