Mask-SL RCNN: Feature-Enhanced 3D Object Detection Network for Point Clouds

Yuanhong Zhong; Guangxia Yang; Dihang Deng; Panliang Tang; Fan Ren

doi:10.1109/JPHOT.2023.3320186

IEEE Photonics Journal (Jan 2023)

Mask-SL RCNN: Feature-Enhanced 3D Object Detection Network for Point Clouds

Yuanhong Zhong,
Guangxia Yang,
Dihang Deng,
Panliang Tang,
Fan Ren

Affiliations

Yuanhong Zhong: ORCiD; School of Microelectronics and Communication Engineering, Chongqing University, Chongqing, China
Guangxia Yang: ORCiD; School of Microelectronics and Communication Engineering, Chongqing University, Chongqing, China
Dihang Deng: ORCiD; School of Microelectronics and Communication Engineering, Chongqing University, Chongqing, China
Panliang Tang: ORCiD; China Electronics Technology Group Corporation Research Institute, Chongqing, China
Fan Ren: ORCiD; Changan Software Technology Company, Changan Automobile Corp, Chongqing, China

DOI: https://doi.org/10.1109/JPHOT.2023.3320186
Journal volume & issue: Vol. 15, no. 5
pp. 1 – 8

Abstract

Read online

At present, with the original point cloud as input, most of the object detectors use Pointnet++ to extract features of the point cloud based on the Farthest Point Sampling (FPS). However, affected by FPS, feature extraction is incomplete and unstable. Moreover, high-level semantic features lack the internal vertex properties of Regions of Interest (RoI). In order to solve the above problems, we propose the Mask-SL RCNN (Mask-Spherical-neighborhood-global-feature-Layer Region-CNN), a feature-enhanced 3D object detection network. It improves sampling of the farthest point through point-level feature enhancement. In addition, we propose Spherical neighborhood global feature Layer (SL) to supplement the global features and improve the learning ability of network. At last, based on semantic-level feature enhancement, we design grid pooling layer based on vertex attention, which fully explores the boundary characteristics of RoI and increases ability to learn advanced features in RoI. Our network improves detection precision of small objects such as pedestrians. Compared with PointRCNN, it has improved the mAP of simple, medium, and difficult object detection in the KITTI dataset by 2.66%, 1.69%, and 0.67%, respectively.

Published in IEEE Photonics Journal

ISSN: 1943-0655 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Engineering (General). Civil engineering (General): Applied optics. Photonics; Science: Physics: Optics. Light
Website: http://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=4563994

About the journal

Abstract

Keywords