Performance Analysis of Random Forest Algorithm in Automatic Building Segmentation with Limited Data

Ratri Widyastuti; Deni Suwardhi; Irwan Meilano; Andri Hernandi; Nabila S. E. Putri; Asep Yusup Saptari; Sudarman

doi:10.3390/ijgi13070235

ISPRS International Journal of Geo-Information (Jul 2024)

Performance Analysis of Random Forest Algorithm in Automatic Building Segmentation with Limited Data

Ratri Widyastuti,
Deni Suwardhi,
Irwan Meilano,
Andri Hernandi,
Nabila S. E. Putri,
Asep Yusup Saptari,
Sudarman

Affiliations

Ratri Widyastuti: Spatial System and Cadastre Research Group, Faculty of Earth Sciences and Technology, Institut Teknologi Bandung, Bandung 40132, Indonesia
Deni Suwardhi: Spatial System and Cadastre Research Group, Faculty of Earth Sciences and Technology, Institut Teknologi Bandung, Bandung 40132, Indonesia
Irwan Meilano: Spatial System and Cadastre Research Group, Faculty of Earth Sciences and Technology, Institut Teknologi Bandung, Bandung 40132, Indonesia
Andri Hernandi: Spatial System and Cadastre Research Group, Faculty of Earth Sciences and Technology, Institut Teknologi Bandung, Bandung 40132, Indonesia
Nabila S. E. Putri: Spatial System and Cadastre Research Group, Faculty of Earth Sciences and Technology, Institut Teknologi Bandung, Bandung 40132, Indonesia
Asep Yusup Saptari: Spatial System and Cadastre Research Group, Faculty of Earth Sciences and Technology, Institut Teknologi Bandung, Bandung 40132, Indonesia
Sudarman: Spatial System and Cadastre Research Group, Faculty of Earth Sciences and Technology, Institut Teknologi Bandung, Bandung 40132, Indonesia

DOI: https://doi.org/10.3390/ijgi13070235
Journal volume & issue: Vol. 13, no. 7
p. 235

Abstract

Read online

Airborne laser technology produces point clouds that can be used to build 3D models of buildings. However, the work is a laborious process that could benefit from automation. Artificial intelligence (AI) has been widely used in automating building segmentation as one of the initial stages in the 3D modeling process. The algorithms with a high success rate using point clouds for automatic semantic segmentation are random forest (RF) and PointNet++, with each algorithm having its own advantages and disadvantages. However, the training and testing data to develop and test the model usually share similar characteristics. Moreover, producing a good automation model requires a lot of training data, which may become an issue for users with a small amount of training data (limited data). The aim of this research is to test the performance of the RF and PointNet++ models in different regions with limited training and testing data. We found that the RF model developed from a small amount data, in different regions between the training and testing data, performs well compared to PointNet++, yielding an OA score of 73.01% for the RF model. Furthermore, several scenarios have been used in this research to explore the capabilities of RF in several cases.

Published in ISPRS International Journal of Geo-Information

ISSN: 2220-9964 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Geography. Anthropology. Recreation: Geography (General)
Website: http://www.mdpi.com/journal/ijgi

About the journal

Abstract

Keywords