Journal of Hydroinformatics (May 2023)
Short-term water demand forecasting using data-centric machine learning approaches
Abstract
Accurate water demand forecasting is the key to urban water management and can alleviate system pressure brought by urbanisation, water scarcity and climate change. However, existing research on water demand forecasting using machine learning is focused on model-centric approaches, where various forecasting models are tested to improve accuracy. The study undertakes a data-centric machine learning approach by analysing the impact of training data length, temporal resolution and data uncertainty on forecasting model results. The models evaluated are Autoregressive (AR) Integrated Moving Average (ARIMA), Neural Network (NN), Random Forest (RF) and Prophet. The first two are commonly used forecasting models. RF has shown similar forecast accuracy to NN but has received less attention. Prophet is a new model that has not been applied to short-term water demand forecasting, though it has had successful applications in various fields. The results obtained from four case studies show that (1) data-centric machine learning approaches offer promise for improving forecast accuracy of short-term water demands; (2) accurate forecasts are possible with short training data; (3) RF and NN models are superior at forecasting high-temporal resolution data; and (4) data quality improvement can achieve a level of accuracy increase comparable to model-centric machine learning approaches. HIGHLIGHTS Data-centric machine learning approaches offer promise for improving the forecast accuracy of short-term water demands.; Accurate forecasts are possible with short training data.; Random forest and neural network models are superior at forecasting high-temporal resolution data.; Data quality improvements can achieve a level of accuracy increase comparable to model-centric machine learning approaches.;
Keywords