Geo Data (Dec 2020)
AI Training Dataset for Cloud Detection of KOMPSAT Images
Abstract
Clouds that appear inevitably when acquiring optical satellite images hinder the interpretation of surface information, so removing them is a crucial procedure to increase the utilization of satellite images. Currently, for KOMPSAT (Korea Multi-purpose Satellite) images, only the cloud amount by visual measurement is proved for the entire scene and detailed cloud masks are not provided. Since cloud detection is a time-consuming task, we built a cloud dataset for KOMPSAT images so as to develop an algorithm that expedites the task with state-of-the-art artificial intelligent techniques. In the dataset, satellite images were selected from various regions considering that clouds have different characteristics depending on the region, and masks were classified into thin clouds, thick clouds, cloud shadows, and clear sky. The size of dataset is over 4,000 image/mask pairs by an image size of 1000x1000 and one of the largest among publicly available cloud datasets, as of this writing. The dataset is built by a government AI (artificial intelligent) training dataset building program and will be available through the website, aihub.or.kr.
Keywords