CartoMark: a benchmark dataset for map pattern recognition and map content retrieval with machine intelligence

Xiran Zhou; Yi Wen; Zhenfeng Shao; Wenwen Li; Kaiyuan Li; Honghao Li; Xiao Xie; Zhigang Yan

doi:10.1038/s41597-024-04057-7

Scientific Data (Nov 2024)

CartoMark: a benchmark dataset for map pattern recognition and map content retrieval with machine intelligence

Xiran Zhou,
Yi Wen,
Zhenfeng Shao,
Wenwen Li,
Kaiyuan Li,
Honghao Li,
Xiao Xie,
Zhigang Yan

Affiliations

Xiran Zhou: Key Laboratory of Surveying and Mapping Science and Geospatial Information Technology of MNR, Chinese Academy of Surveying and Mapping
Yi Wen: School of Environment and Spatial Informatics, China University of Mining and Technology
Zhenfeng Shao: State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University
Wenwen Li: School of Geographical Sciences and Urban Planning, Arizona State University
Kaiyuan Li: School of Environment and Spatial Informatics, China University of Mining and Technology
Honghao Li: School of Environment and Spatial Informatics, China University of Mining and Technology
Xiao Xie: Key Lab for Environmental Computation and Sustainability of Liaoning Province
Zhigang Yan: School of Environment and Spatial Informatics, China University of Mining and Technology

DOI: https://doi.org/10.1038/s41597-024-04057-7
Journal volume & issue: Vol. 11, no. 1
pp. 1 – 13

Abstract

Read online

Abstract Maps are fundamental medium to visualize and represent the real word in a simple and philosophical way. The emergence of the big data tide has made a proportion of maps generated from multiple sources, significantly enriching the dimensions and perspectives for understanding the characteristics of the real world. However, a majority of these map datasets remain undiscovered, unacquired and ineffectively used, which arises from the lack of numerous well-labelled benchmark datasets, which are of significance to implement the deep learning techniques into identifying complicated map content. To address this issue, we develop a large-scale benchmark dataset involving well-labelled datasets to employ the state-of-the-art machine intelligence technologies for map text annotation recognition, map scene classification, map super-resolution reconstruction, and map style transferring. Furthermore, these well-labelled datasets would facilitate map feature detection, map pattern recognition and map content retrieval. We hope our efforts would provide well-labelled data resources for advancing the ability to recognize and discover valuable map content.

Published in Scientific Data

ISSN: 2052-4463 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Science
Website: https://www.nature.com/sdata/

About the journal