SEMANTIC SEGMENTATION OF INDOOR 3D POINT CLOUD WITH SLENET

Y. Ding; X. Zheng; H. Xiong; Y. Zhang

doi:10.5194/isprs-archives-XLII-2-W13-785-2019

The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences (Jun 2019)

SEMANTIC SEGMENTATION OF INDOOR 3D POINT CLOUD WITH SLENET

Y. Ding,
X. Zheng,
H. Xiong,
Y. Zhang

Affiliations

Y. Ding: State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, WuHan University, Hubei, Wuhan, China
X. Zheng: State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, WuHan University, Hubei, Wuhan, China
H. Xiong: State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, WuHan University, Hubei, Wuhan, China
Y. Zhang: School of Mathematics and Statistics, Wuhan University, Hubei, Wuhan, China

DOI: https://doi.org/10.5194/isprs-archives-XLII-2-W13-785-2019
Journal volume & issue: Vol. XLII-2-W13
pp. 785 – 791

Abstract

Read online

With the rapid development of new indoor sensors and acquisition techniques, the amount of indoor three dimensional (3D) point cloud models was significantly increased. However, these massive “blind” point clouds are difficult to satisfy the demand of many location-based indoor applications and GIS analysis. The robust semantic segmentation of 3D point clouds remains a challenge. In this paper, a segmentation with layout estimation network (SLENet)-based 2D–3D semantic transfer method is proposed for robust segmentation of image-based indoor 3D point clouds. Firstly, a SLENet is devised to simultaneously achieve the semantic labels and indoor spatial layout estimation from 2D images. A pixel labeling pool is then constructed to incorporate the visual graphical model to realize the efficient 2D–3D semantic transfer for 3D point clouds, which avoids the time-consuming pixel-wise label transfer and the reprojection error. Finally, a 3D-contextual refinement, which explores the extra-image consistency with 3D constraints is developed to suppress the labeling contradiction caused by multi-superpixel aggregation. The experiments were conducted on an open dataset (NYUDv2 indoor dataset) and a local dataset. In comparison with the state-of-the-art methods in terms of 2D semantic segmentation, SLENet can both learn discriminative enough features for inter-class segmentation while preserving clear boundaries for intra-class segmentation. Based on the excellence of SLENet, the final 3D semantic segmentation tested on the point cloud created from the local image dataset can reach a total accuracy of 89.97%, with the object semantics and indoor structural information both expressed.

Published in The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences

ISSN: 1682-1750 (Print); 2194-9034 (Online)
Publisher: Copernicus Publications
Country of publisher: Germany
LCC subjects: Technology: Engineering (General). Civil engineering (General): Applied optics. Photonics
Website: http://www.isprs.org/publications/archives.aspx

About the journal