Journal of Hydroinformatics (May 2022)

Comparison of EEMD-ARIMA, EEMD-BP and EEMD-SVM algorithms for predicting the hourly urban water consumption

  • Xingpo Liu,
  • Yiqing Zhang,
  • Qichen Zhang

DOI
https://doi.org/10.2166/hydro.2022.146
Journal volume & issue
Vol. 24, no. 3
pp. 535 – 558

Abstract

Read online

Short-term (e.g., hourly) urban water consumption (or demand) prediction is of great significance for the optimal operation of the intelligent water distribution pump stations. In this study, three single models (autoregressive integrated moving average (ARIMA), back-propagation (BP), support vector machine (SVM)) and three hybrid models (ensemble empirical mode decomposition (EEMD)-ARIMA, EEMD-BP and EEMD-SVM) were developed and compared in terms of prediction accuracy and application convenience. 31-day (1 month) hourly flow series from a water distribution division in Shanghai were used for the demonstration case study, among which 30-day data used for model training and 1-day data used for model verification. Finally, the effects of historical data length on the prediction accuracy of three hybrid models were also analyzed, and the optima of the historical data length for three hybrid models were obtained. Results reveal that (1) the mean absolute percentage errors (MAPE) of EEMD-ARIMA, EEMD-BP, EEMD-SVM, ARIMA, BP and SVM are 5.2036, 1.4460, 1.3424, 5.7891, 4.3857 and 3.8470%, respectively. (2) In terms of prediction accuracy and actual practice convenience, EEMD-SVM performs best among the above six models. (3) The EEMD algorithm is effective for improving the prediction accuracy of six models. (4) The optimal historical data length of EEMD-ARIMA, EEMD-BP and EEMD-SVM are 11, 11 and 10 days, respectively. HIGHLIGHTS Three single models (ARIMA, BP and SVM) and three hybrid models (EEMD-ARIMA, EEMD-BP and EEMD-SVM) were compared for the prediction of hourly water demand.; EEMD-SVM performs best among the six prediction models.; The EEMD algorithm is significant for improving prediction accuracy.; The optimal historical data length for intelligent algorithms should be greater than a week.;

Keywords