A Single Data Extraction Algorithm for Oblique Photographic Data Based on the U-Net

Shaohua Wang; Xiao Li; Liming Lin; Hao Lu; Ying Jiang; Ning Zhang; Wenda Wang; Jianwei Yue; Ziqiong Li

doi:10.3390/rs16060979

Remote Sensing (Mar 2024)

A Single Data Extraction Algorithm for Oblique Photographic Data Based on the U-Net

Shaohua Wang,
Xiao Li,
Liming Lin,
Hao Lu,
Ying Jiang,
Ning Zhang,
Wenda Wang,
Jianwei Yue,
Ziqiong Li

Affiliations

Shaohua Wang: Faculty of Geomatics, Lanzhou Jiaotong University, Lanzhou 730070, China
Xiao Li: Faculty of Geomatics, Lanzhou Jiaotong University, Lanzhou 730070, China
Liming Lin: STATE GRID Location-Based Service Co., Ltd., Beijing 100015, China
Hao Lu: SuperMap Software Co., Ltd., Beijing 100015, China
Ying Jiang: STATE GRID Location-Based Service Co., Ltd., Beijing 100015, China
Ning Zhang: Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China
Wenda Wang: Faculty of Geomatics, Lanzhou Jiaotong University, Lanzhou 730070, China
Jianwei Yue: Faculty of Geographical Science, Beijing Normal University, Beijing 100875, China
Ziqiong Li: The Bartlett Centre for Advanced Spatial Analysis, University College London, London W1T 4TJ, UK

DOI: https://doi.org/10.3390/rs16060979
Journal volume & issue: Vol. 16, no. 6
p. 979

Abstract

Read online

In the automated modeling generated by oblique photography, various terrains cannot be physically distinguished individually within the triangulated irregular network (TIN). To utilize the data representing individual features, such as a single building, a process of building monomer construction is required to identify and extract these distinct parts. This approach aids subsequent analyses by focusing on specific entities, mitigating interference from complex scenes. A deep convolutional neural network is constructed, combining U-Net and ResNeXt architectures. The network takes as input both digital orthophoto map (DOM) and oblique photography data, effectively extracting the polygonal footprints of buildings. Extraction accuracy among different algorithms is compared, with results indicating that the ResNeXt-based network achieves the highest intersection over union (IOU) for building segmentation, reaching 0.8255. The proposed “dynamic virtual monomer” technique binds the extracted vector footprints dynamically to the original oblique photography surface through rendering. This enables the selective representation and querying of individual buildings. Empirical evidence demonstrates the effectiveness of this technique in interactive queries and spatial analysis. The high level of automation and excellent accuracy of this method can further advance the application of oblique photography data in 3D urban modeling and geographic information system (GIS) analysis.

Published in Remote Sensing

ISSN: 2072-4292 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science
Website: http://www.mdpi.com/journal/remotesensing/

About the journal

Abstract

Keywords