Alexandria Engineering Journal (Jan 2024)
Incorporating novel input variable selection method for in the different water basins of Thailand
Abstract
Selecting appropriate input variables for developing a rainfall prediction model is significantly difficult. The present study proposed an innovative framework for input variable selection (IVS) model bootstrapped long short-term recurrent neural network (BTSP-LSTM-RNN) to identify relevant variables for monthly rainfall forecasting. Monthly meteorological and large-scale climatic variables (LCVs) from 1993 to 2022 at two selected river basins in the northern region of Thailand were used for model development. The proposed BTSP-LSTM-RNN model results were compared with the support vector regression with recursive feature elimination (SVR-RFE) and Gradient boosting (GB) by statistical metrics such as coefficient of determination (R2), mean absolute error (MAE), relative root mean squared error (RRMSE), Pearson’s correlation coefficient (r) and mean absolute percentage error (MAPE). BTSP-LSTM-RNN demonstrated exceptional performance, boasting a higher R2 (0.84), MAE (92.28), RRMSE (10.36) in the Wang basin, and R2 (0.83), MAE (242.60), RRMSE (9.93) in the Nan basin. BTSP-LSTM-RNN also achieved the lowest MAPE of 29.82%. Based on this IVS model results, two input variable combinations (IVCs) were designed. IVC-1 is based on BTSP-LSTM-RNN selection, and IVC-2 is an original set of variables. LSTM-RNN, multi-layer perceptron artificial neural network (MLP-ANN), and ensemble model with bootstrapping on random forest (RF) were employed for monthly prediction. When BTSP-LSTM-RNN's selected input variables from IVC-1 are utilized, the BSTP-RF model demonstrates robust performance. It achieves a high R2 (0.82), a low RRMSE (10.13%) suggests accurate predictions, and the r of 0.91 further supports the model's strong linear relationship with observed rainfall data. Based on prediction model results, the BTSP-LSTM-RNN (IVS) model plays a pivotal role in the selection of input variables for rainfall forecasting and its impact on the performance of prediction models (BSTP-RF, MLP-ANN, and LSTM-RNN). These results consistently underscored the pivotal role of the BTSP-LSTM-RNN IVS model in enhancing the precision and reliability of rainfall predictions.