Engineering (Sep 2023)

Runoff Modeling in Ungauged Catchments Using Machine Learning Algorithm-Based Model Parameters Regionalization Methodology

  • Houfa Wu,
  • Jianyun Zhang,
  • Zhenxin Bao,
  • Guoqing Wang,
  • Wensheng Wang,
  • Yanqing Yang,
  • Jie Wang

Journal volume & issue
Vol. 28
pp. 93 – 104

Abstract

Read online

Model parameters estimation is a pivotal issue for runoff modeling in ungauged catchments. The nonlinear relationship between model parameters and catchment descriptors is a major obstacle for parameter regionalization, which is the most widely used approach. Runoff modeling was studied in 38 catchments located in the Yellow–Huai–Hai River Basin (YHHRB). The values of the Nash–Sutcliffe efficiency coefficient (NSE), coefficient of determination (R2), and percent bias (PBIAS) indicated the acceptable performance of the soil and water assessment tool (SWAT) model in the YHHRB. Nine descriptors belonging to the categories of climate, soil, vegetation, and topography were used to express the catchment characteristics related to the hydrological processes. The quantitative relationships between the parameters of the SWAT model and the catchment descriptors were analyzed by six regression-based models, including linear regression (LR) equations, support vector regression (SVR), random forest (RF), k-nearest neighbor (kNN), decision tree (DT), and radial basis function (RBF). Each of the 38 catchments was assumed to be an ungauged catchment in turn. Then, the parameters in each target catchment were estimated by the constructed regression models based on the remaining 37 donor catchments. Furthermore, the similarity-based regionalization scheme was used for comparison with the regression-based approach. The results indicated that the runoff with the highest accuracy was modeled by the SVR-based scheme in ungauged catchments. Compared with the traditional LR-based approach, the accuracy of the runoff modeling in ungauged catchments was improved by the machine learning algorithms because of the outstanding capability to deal with nonlinear relationships. The performances of different approaches were similar in humid regions, while the advantages of the machine learning techniques were more evident in arid regions. When the study area contained nested catchments, the best result was calculated with the similarity-based parameter regionalization scheme because of the high catchment density and short spatial distance. The new findings could improve flood forecasting and water resources planning in regions that lack observed data.

Keywords