Geo-spatial Information Science (Sep 2024)
A large scale training sample database system for intelligent interpretation of remote sensing imagery
Abstract
Artificial Intelligence (AI) Machine Learning (ML) technologies, particularly Deep Learning (DL), have demonstrated significant potential in the interpretation of Remote Sensing (RS) imagery, covering tasks such as scene classification, object detection, land-cover/land-use classification, change detection, and multi-view stereo reconstruction. Large-scale training samples are essential for ML/DL models to achieve optimal performance. However, the current organization of training samples is ad-hoc and vendor-specific, lacking an integrated approach that can effectively manage training samples from different vendors to meet the demands of various RS AI tasks. This article proposes a solution to address these challenges by designing and implementing LuoJiaSET, a large-scale training sample database system for intelligent interpretation of RS imagery. LuoJiaSET accommodates over five million training samples, providing support for cross-dataset queries and serving as a comprehensive training data store for RS AI model training and calibration. It overcomes challenges related to label semantic categories, structural heterogeneity in label representation, and interoperable data access.
Keywords