A new method for reconstructing building model using machine learning

Shengjie Wu; Haibo Ye; Antao Li; Huawei Tu; Shenxin Xu; Dong Liang

doi:10.26599/jic.2025.9180041

Journal of Intelligent Construction (Jan 2025)

A new method for reconstructing building model using machine learning

Shengjie Wu,
Haibo Ye,
Antao Li,
Huawei Tu,
Shenxin Xu,
Dong Liang

Affiliations

Shengjie Wu: College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China
Haibo Ye: College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China
Antao Li: Shanghai Institute of Satellite Engineering, Shanghai 201109, China
Huawei Tu: Department of Computer Science and Information Technology, La Trobe University, Melbourne 3086, Australia
Shenxin Xu: Shanghai Institute of Satellite Engineering, Shanghai 201109, China
Dong Liang: College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China

DOI: https://doi.org/10.26599/jic.2025.9180041
Journal volume & issue: Vol. 3, no. 1
p. 9180041

Abstract

Read online

Three-dimensional (3D) model reconstruction is used in an increasing number of fields related to construction, such as urban planning, mobile communication planning, and solar power assessment. Existing 3D reconstruction models mostly focus on precise measurements, such as laser scanning and ultrasonic mapping. Although these methods can achieve very precise results, they require specific equipment, which is typically expensive. The essence of 3D reconstruction is to infer the overall view of a building through pictures from previously taken perspectives, thereby obtaining pictures from unfamiliar perspectives. In this study, the rendering method is adopted as the starting point, and architectural features are learned by training a neural network to provide the necessary information for rendering. Unlike the more popular projection-based raster rendering method, this study uses a point-based volume rendering method and light sampling to detect architectural features. This rendering method requires the color and density of specific sampling points. Therefore, this study attempts to train a neural network to fit a five-dimensional (5D) function. The input to this function is a 5D vector, including the position (x, y, z) and viewing direction (θ, φ), and the output is the color and density of this point when viewed from this direction. This study adopts the positional encoding method, which reduces the scale of the network and increases both the training and rendering speeds. Our method can train a usable network in dozens of seconds and render a building at 30–60 frames per second.

Published in Journal of Intelligent Construction

ISSN: 2958-3861 (Print); 2958-2652 (Online)
Publisher: Tsinghua University Press
Country of publisher: China
LCC subjects: Technology: Hydraulic engineering; Technology: Engineering (General). Civil engineering (General): Structural engineering (General)
Website: https://www.sciopen.com/journal/2958-3861

About the journal

Abstract

Keywords