Learning Dense Consistent Features for Aerial-to-Ground Structure-From-Motion

Hongjie Li; Aonan Liu; Xiao Xie; Han Guo; Hanjiang Xiong; Xianwei Zheng

doi:10.1109/JSTARS.2023.3279199

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (Jan 2023)

Learning Dense Consistent Features for Aerial-to-Ground Structure-From-Motion

Hongjie Li,
Aonan Liu,
Xiao Xie,
Han Guo,
Hanjiang Xiong,
Xianwei Zheng

Affiliations

Hongjie Li: ORCiD; Key Laboratory of Urban Land Resources Monitoring and Simulation, Ministry of Natural Resources, Shenzhen and the State Key Laboratory LIESMARS, Wuhan University, Wuhan, China
Aonan Liu: State Key Laboratory LIESMARS, Wuhan University, Wuhan, China
Xiao Xie: ORCiD; Key Laboratory for Environmental Computation and Sustainability of Liaoning Province, Institute of Applied Ecology, Chinese Academy of Sciences, Shenyang, China
Han Guo: ORCiD; Key Laboratory of Urban Land Resources Monitoring and Simulation, Ministry of Natural Resources, Shenzhen, China
Hanjiang Xiong: ORCiD; State Key Laboratory LIESMARS, Wuhan University, Wuhan, China
Xianwei Zheng: ORCiD; Key Laboratory of Urban Land Resources Monitoring and Simulation, Ministry of Natural Resources, Shenzhen and the State Key Laboratory LIESMARS, Wuhan University, Wuhan, China

DOI: https://doi.org/10.1109/JSTARS.2023.3279199
Journal volume & issue: Vol. 16
pp. 5089 – 5102

Abstract

Read online

The integration of aerial and ground images is known to be effective for enhancing the quality of 3-D reconstruction in complex urban scenarios. However, directly applying the structure-from-motion (SfM) technique for unified 3-D reconstruction with aerial and ground images is particularly difficult, due to the large differences in viewpoint, scale, and appearance between those two types of images. Previous studies mainly rely on viewpoint rectification or view rendering/synthesis to improve the feature matching quality for aligning the aerial and ground models. Nevertheless, these approaches still fail to address the inherent information differences between aerial and ground images. In this article, we propose a learning-based matching framework for direct SfM with ground and aerial images. The key idea of our method is to learn the pixel-wise consistent features between aerial and ground images to handle the large heterogeneity of these two types of images. Specifically, we deploy a learning-based matching framework to robustly correspond the aerial and ground images. With the high-quality feature matching, learned feature maps are used for refining keypoint locations and fusing featuremetric error into bundle adjustment with the consideration of geometric error, both of which can further improve the accuracy and completeness of the recovered 3-D scene. Extensive experiments conducted on six datasets demonstrate that the proposed method can reconstruct high-fidelity 3-D models with direct aerial-to-ground SfM, which cannot be achieved by existing methods. In addition, our method also shows outstanding performance in subtasks of feature matching and point cloud recovery.

Published in IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

ISSN: 1939-1404 (Print); 2151-1535 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Ocean engineering; Science: Physics: Geophysics. Cosmic physics
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=4609443

About the journal

Abstract

Keywords