Geo-spatial Information Science (Sep 2024)
An automatically recursive feature elimination method based on threshold decision in random forest classification
Abstract
The rich feature information contained in the diverse remote sensing data has also exhibited growing potential in the field of image classification. However, the processing of multi-feature data still grapples with the challenges posed by the “curse of dimensionality” and the high computational costs. This paper proposes a remote sensing feature parameter selection method. This method is based on the Gini index of a Random Forest (RF) classifier and employs a 10% threshold decision to identify the optimal combination of remote sensing feature parameters. First, spectral features, texture features, temperature features, elevation features, and principal component features are selected to create a stack of remote sensing images. Second, multiple sets of decision trees are established to cross-validate the contributions of various feature parameters. The feature rankings are determined based on the normalized mean of feature importance. Subsequently, a threshold isset to filter out remote sensing feature parameters that meet the criteria. Finally, an iterative parameter optimization process is carried out to obtain the optimal combination of remote sensing feature parameters. The Landsat 8 image covering Hangzhou Bay in eastern China and the Sentinel-2 remote sensing image of Yancheng Nature Reserve in Jiangsu, China were selected for experiments. The results show that the feature parameters of the remote sensing image screened by the method in this paper are representative, and the proposed method demonstrates strong adaptability and generalization performance in complex environments. The study has important guiding significance for regional spatial planning and sustainable development.
Keywords