Water (Jun 2023)

Optimizing Rotation Forest-Based Decision Tree Algorithms for Groundwater Potential Mapping

  • Wei Chen,
  • Zhao Wang,
  • Guirong Wang,
  • Zixin Ning,
  • Boxiang Lian,
  • Shangjie Li,
  • Paraskevas Tsangaratos,
  • Ioanna Ilia,
  • Weifeng Xue

DOI
https://doi.org/10.3390/w15122287
Journal volume & issue
Vol. 15, no. 12
p. 2287

Abstract

Read online

Groundwater potential mapping is an important prerequisite for evaluating the exploitation, utilization, and recharge of groundwater. The study uses BFT (best-first decision tree classifier), CART (classification and regression tree), FT (functional trees), EBF (evidential belief function) benchmark models, and RF-BFTree, RF-CART, and RF-FT ensemble models to map the groundwater potential of Wuqi County, China. Firstly, select sixteen groundwater spring-related variables, such as altitude, plan curvature, profile curvature, curvature, slope angle, slope aspect, stream power index, topographic wetness index, stream sediment transport index, normalized difference vegetation index, land use, soil, lithology, distance to roads, distance to rivers, and rainfall, and make a correlation analysis of these sixteen groundwater spring-related variables. Secondly, optimize the parameters of the seven models and select the optimal parameters for groundwater modeling in Wuqi County. The predictive performance of each model was evaluated by estimating the area under the receiver operating characteristic (ROC) curve (AUC) and statistical index (accuracy, sensitivity, and specificity). The results show that the seven models have good predictive capabilities, and the ensemble model has a larger AUC value. Among them, the RF-BFT model has the highest success rate (AUC = 0.911), followed by RF-FT (0.898), RF-CART (0.894), FT (0.852), EBF (0.824), CART (0.801), and BFtree (0.784), respectively. Groundwater potential maps of these 7 models were obtained, and four different classification methods (geometric interval, natural breaks, quantile, and equal interval) were used to reclassify the obtained GPM into 5 categories: very low (VLC), low (LC), moderate (MC), high (HC), and very high (VHC). The results show that the natural breaks method has the best classification performance, and the RF-BFT model is the most reliable. The study highlights that the proposed ensemble model has more efficient and accurate performance for groundwater potential mapping.

Keywords