Ortho-NeRF: generating a true digital orthophoto map using the neural radiance field from unmanned aerial vehicle images

Shihan Chen; Qingsong Yan; Yingjie Qu; Wang Gao; Junxing Yang; Fei Deng

doi:10.1080/10095020.2023.2296014

Geo-spatial Information Science (Mar 2024)

Ortho-NeRF: generating a true digital orthophoto map using the neural radiance field from unmanned aerial vehicle images

Shihan Chen,
Qingsong Yan,
Yingjie Qu,
Wang Gao,
Junxing Yang,
Fei Deng

Affiliations

Shihan Chen: School of Geodesy and Geomatics, Wuhan University, Wuhan, China
Qingsong Yan: School of Geodesy and Geomatics, Wuhan University, Wuhan, China
Yingjie Qu: School of Geodesy and Geomatics, Wuhan University, Wuhan, China
Wang Gao: Science and Technology on Complex System Control and Intelligent Agent Cooperation Laboratory, Beijing, China
Junxing Yang: School of Geomatics and Urban Spatial Informatics, Beijing University of Civil Engineering and Architecture, Beijing, China
Fei Deng: School of Geodesy and Geomatics, Wuhan University, Wuhan, China

DOI: https://doi.org/10.1080/10095020.2023.2296014

Abstract

Read online

True Digital Orthophoto Maps (TDOMs) have high geometric accuracy and rich image characteristics, making them essential geographic data for national economic and social development. Complex terrain and artificial structures, automatic distortion elimination and occluded area recovery in TDOM generation pose significant challenges. Hence, the need for further improvements in both mapping accuracy and automation is highlighted. In this paper, we present an approach for generating a TDOM based on a Neural Radiance Field (NeRF) without utilizing prior three-dimensional geometry information called an Ortho Neural Radiance Field (Ortho-NeRF). The Ortho-NeRF divides a large-scale scene into small tiles, implicitly reconstructing each tile by selecting pixels on posed images, and individually generate TDOMs of all tiles using a true-ortho-volume rendering before mosaicking. Additionally, the Ortho-NeRF uses a strategy to skip empty spaces and adaptively set the spatial resolution of a voxel grid, improving the generated TDOM quality with fewer computational resources. Many experiments showed that our approach outperforms ContextCapture, Metashape, Pix4DMapper, and Map2DFusion, especially in challenging areas. Owing to its global consistency and continuous nature, Ortho-NeRF was able to effectively reconstruct the geometry information and details, generating TDOMs without distortion or misalignment. Eight ground control points were randomly selected to evaluate the geometric accuracy of the TDOMs, with an average median error of 0.267 m. The length between two points on a plane was also measured for quantitative evaluation, with a mean absolute error of 0.08 m and a mean relative error of 0.14%. Compared with the NeRF efficiency, that of the Ortho-NeRF increased 104 times in training and about 1000 times in rendering.

Published in Geo-spatial Information Science

ISSN: 1009-5020 (Print); 1993-5153 (Online)
Publisher: Taylor & Francis Group
Country of publisher: United Kingdom
LCC subjects: Geography. Anthropology. Recreation: Mathematical geography. Cartography; Science: Astronomy: Geodesy
Website: https://www.tandfonline.com/journals/tgsi

About the journal

Abstract

Keywords