Remote Sensing (Jul 2024)
Winter Wheat Mapping Method Based on Pseudo-Labels and U-Net Model for Training Sample Shortage
Abstract
In recent years, the semantic segmentation model has been widely applied in fields such as the extraction of crops due to its advantages such as strong discrimination ability, high accuracy, etc. Currently, there is no standard set of ground true label data for major crops in China, and the visual interpretation process is usually time-consuming and laborious. The sample size also makes it difficult to support the model to learn enough ground features, resulting in poor generalisation ability of the model, which in turn makes the model difficult to apply in fine extraction tasks of large-area crops. In this study, a method to establish a pseudo-label sample set based on the random forest algorithm to train a semantic segmentation model (U-Net) was proposed to perform winter wheat extraction. With the help of the GEE platform, Winter Wheat Canopy Index (WCI) indicators were employed in this method to initially extract winter wheat, and training samples (i.e., pseudo labels) were built for the semantic segmentation model through the iterative process of “generating random sample points—random forest model training—winter wheat extraction”; on this basis, the U-net model was trained with multi-time series remote sensing images; finally, the U-Net model was employed to obtain the spatial distribution map of winter wheat in Henan Province in 2022. The results illustrated that: (1) Pseudo-label data were constructed using the random forest model in typical regions, achieving an overall accuracy of 97.53% under validation with manual samples, proving that its accuracy meets the requirements for U-Net model training. (2) Utilizing the U-Net model, U-Net++ model, and random forest model constructed based on pseudo-label data for 2022, winter wheat mapping was conducted in Henan Province. The extraction accuracy of the three models is in the order of U-Net model > U-Net++ model > random forest model. (3) Using the U-Net model to predict the winter wheat planting areas in Henan Province in 2019, although the extraction accuracy decreased compared to 2022, it still exceeded that of the random forest model. Additionally, the U-Net++ model did not achieve higher classification accuracy. (4) Experimental results demonstrate that deep learning models constructed based on pseudo-labels exhibit higher classification accuracy. Compared to traditional machine learning models like random forest, they have higher spatiotemporal adaptability and robustness, further validating the scientific and practical feasibility of pseudo-labels and their generation strategies, which are expected to provide a feasible technical pathway for intelligent extraction of winter wheat spatial distribution information in the future.
Keywords