Hydrology (Feb 2022)
An Efficient Data Driven-Based Model for Prediction of the Total Sediment Load in Rivers
Abstract
Sediment load in fluvial systems is one of the critical factors shaping the river geomorphological and hydraulic characteristics. A detailed understanding of the total sediment load (TSL) is required for the protection of physical, environmental, and ecological functions of rivers. This study develops a robust methodological approach based on multiple linear regression (MLR) and support vector regression (SVR) models modified by principal component analysis (PCA) to predict the TSL in rivers. A database of sediment measurement from large-scale physical modelling tests with 4759 datapoints were used to develop the predictive model. A dimensional analysis was performed based on the literature, and ten dimensionless parameters were identified as the key drivers of the TSL in rivers. These drivers were converted to uncorrelated principal components to feed the MLR and SVR models (PCA-based MLR and PCA-based SVR models) developed within this study. A stepwise PCA-based MLR and a 10-fold PCA-based SVR model with different kernel-type functions were tuned to derive an accurate TSL predictive model. Our findings suggest that the PCA-based SVR model with the kernel-type radial basis function has the best predictive performance in terms of statistical error measures including the root-mean-square error normalized with the standard deviation (RMSE/StD) and the Nash–Sutcliffe coefficient of efficiency (NSE), for the estimation of the TSL in rivers. The PCA-based MLR and PCA-based SVR models, with an overall RMSE/StD of 0.45 and 0.35, respectively, outperform the existing well-established empirical formulae for TSL estimation. The analysis of the results confirms the robustness of the proposed PCA-based SVR model for prediction of the cases with high concentration of sediments (NSE = 0.68), where the existing sediment estimation models usually have poor performance.
Keywords