VKP-P3D: Real-Time Monocular Pseudo 3D Object Detection Based on Visible Key Points and Camera Geometry

Changliang Sun; Hongli Liu; Weichu Xiao; Bo Shi; Yuan Qiu

doi:10.1109/ACCESS.2024.3378105

IEEE Access (Jan 2024)

VKP-P3D: Real-Time Monocular Pseudo 3D Object Detection Based on Visible Key Points and Camera Geometry

Changliang Sun,
Hongli Liu,
Weichu Xiao,
Bo Shi,
Yuan Qiu

Affiliations

Changliang Sun: ORCiD; College of Electrical and Information Engineering, Hunan University, Changsha, China
Hongli Liu: ORCiD; College of Electrical and Information Engineering, Hunan University, Changsha, China
Weichu Xiao: College of Electrical and Information Engineering, Hunan University, Changsha, China
Bo Shi: College of Electrical and Information Engineering, Hunan University, Changsha, China
Yuan Qiu: College of Electrical and Information Engineering, Hunan University, Changsha, China

DOI: https://doi.org/10.1109/ACCESS.2024.3378105
Journal volume & issue: Vol. 12
pp. 41883 – 41895

Abstract

Read online

Three-dimensional object detection has been substantially improved with the use of expensive LiDAR and stereo vision systems in intelligent driving. The less-expensive and more scalable solution of monocular 3D object detection, however, remains a key challenge. This study primarily explores real-time pseudo 3D object detection with monocular vision and designs a single-shot RPN model, VKP-P3D, which relies purely on visual feature extraction. Through a multiscale feature fusion and an attention mechanism module, this model obtains high-dimensional feature representations during the feature extraction phase. In the detection head of the VKP-P3D model, the pseudo 3D object detection is obtained by regressing 2D bounding box and the visible key points within the image coordinate of the 3D box from the camera’s perspective. Finally, assuming flat ground and considering geometric parameters of the camera, the object’s 3D information can be extracted. To verify the effectiveness of the proposed algorithm, we constructed two pseudo 3D object detection datasets based on visible key points and compared with current state-of-the-art real-time object detector. Results showed that the proposed model has high detection accuracy and speed.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords