IET Image Processing (Oct 2023)

CMLocate: A cross‐modal automatic visual geo‐localization framework for a natural environment without GNSS information

  • Zhuoqun Liu,
  • Fan Guo,
  • Heng Liu,
  • Xiaoyue Xiao,
  • Jin Tang

DOI
https://doi.org/10.1049/ipr2.12883
Journal volume & issue
Vol. 17, no. 12
pp. 3524 – 3540

Abstract

Read online

Abstract In this paper, a new approach to visual geo‐localization for natural environments is proposed. The digital elevation model (DEM) data in virtual space is rendered and construct a panoramic skyline database is constructed. By combining the skyline database with real‐world image data (used as the “queries” to be localized), visual geo‐localization is treated as a cross‐modal image retrieval problem for panoramic skyline images, creating a unique new visual geo‐localization benchmark for the natural environment. Specifically, the semantic segmentation model named LineNet is proposed, for skyline extractions from query images, which has proven to be robust to a variety of complex natural environments. On the aforementioned benchmarks, the fully automatic method is elaborated for large‐scale cross‐modal localization using panoramic skyline images. Finally, the compound index is delicately designed to reduce the storage space of the positioning global descriptors and improve the retrieval efficiency. Moreover, the proposed method is proven to outperform most state‐of‐the‐art methods.

Keywords