Remote Sensing (Mar 2024)

A Single Data Extraction Algorithm for Oblique Photographic Data Based on the U-Net

  • Shaohua Wang,
  • Xiao Li,
  • Liming Lin,
  • Hao Lu,
  • Ying Jiang,
  • Ning Zhang,
  • Wenda Wang,
  • Jianwei Yue,
  • Ziqiong Li

DOI
https://doi.org/10.3390/rs16060979
Journal volume & issue
Vol. 16, no. 6
p. 979

Abstract

Read online

In the automated modeling generated by oblique photography, various terrains cannot be physically distinguished individually within the triangulated irregular network (TIN). To utilize the data representing individual features, such as a single building, a process of building monomer construction is required to identify and extract these distinct parts. This approach aids subsequent analyses by focusing on specific entities, mitigating interference from complex scenes. A deep convolutional neural network is constructed, combining U-Net and ResNeXt architectures. The network takes as input both digital orthophoto map (DOM) and oblique photography data, effectively extracting the polygonal footprints of buildings. Extraction accuracy among different algorithms is compared, with results indicating that the ResNeXt-based network achieves the highest intersection over union (IOU) for building segmentation, reaching 0.8255. The proposed “dynamic virtual monomer” technique binds the extracted vector footprints dynamically to the original oblique photography surface through rendering. This enables the selective representation and querying of individual buildings. Empirical evidence demonstrates the effectiveness of this technique in interactive queries and spatial analysis. The high level of automation and excellent accuracy of this method can further advance the application of oblique photography data in 3D urban modeling and geographic information system (GIS) analysis.

Keywords