Heliyon (Apr 2024)

Estimation of soil erodibility in Peninsular Malaysia: A case study using multiple linear regression and artificial neural networks

  • Muhammad Ali Rehman,
  • Norinah Abd Rahman,
  • Ahmad Nazrul Hakimi Ibrahim,
  • Norashikin Ahmad Kamal,
  • Asmadi Ahmad

Journal volume & issue
Vol. 10, no. 7
p. e28854

Abstract

Read online

Soil erodibility (K) is an essential component in estimating soil loss indicating the soil's susceptibility to detach and transport. Data Computing and processing methods, such as artificial neural networks (ANNs) and multiple linear regression (MLR), have proven to be helpful in the development of predictive models for natural hazards. The present case study aims to assess the efficiency of MLR and ANN models to forecast soil erodibility in Peninsular Malaysia. A total of 103 samples were collected from various sites and K values were calculated using the Tew equation developed for Malaysian soil. From several extracted parameters, the outcomes of correlation and principal component analysis (PCA) revealed the influencing factors to be used in the development of ANN and MLR models. Based on the correlation and PCA results, two sets of influencing factors were employed to develop predictive models. Two MLR (MLR-1 and MLR-2) models and four neural networks (NN-1, NN-2, NN-3, and NN-4) optimized using Levenberg-Marquardt (LM) and scaled conjugate gradient (SCG) were developed and evaluated. The model performance validation was conducted using the coefficient of determination (R2), mean squared error (MSE), root mean squared error (RMSE), and Nash-Sutcliffe efficiency coefficient (NSE). The analysis showed that ANN models outperformed MLR models. The R2 values of 0.446 (MLR-1), 0.430 (MLR-2), 0.894 (NN-1), 0.855 (NN-2), 0.940 (NN-3), and 0.826 (NN-4); MSE values of 0.0000306 (MLR-1), 0.0000315 (MLR-2), 0.0000158 (NN-1), 0.0000261 (NN-2), 0.0000318 (NN-3), and 0.0000216 (NN-4) suggested the higher accuracy and lower modelling error of ANN models as compared with MLR. This study could provide an empirical basis and methodological support for K factor estimation in the region.

Keywords