Investigation of cross-entropy-based streamflow forecasting through an efficient interpretable automated search process

K. L. Chong; Y. F. Huang; C. H. Koo; Mohsen Sherif; Ali Najah Ahmed; Ahmed El-Shafie

doi:10.1007/s13201-022-01790-5

Applied Water Science (Nov 2022)

Investigation of cross-entropy-based streamflow forecasting through an efficient interpretable automated search process

K. L. Chong,
Y. F. Huang,
C. H. Koo,
Mohsen Sherif,
Ali Najah Ahmed,
Ahmed El-Shafie

Affiliations

K. L. Chong: Department of Civil Engineering, Lee Kong Chian Faculty of Engineering and Science, Universiti Tunku Abdul Rahman
Y. F. Huang: Department of Civil Engineering, Lee Kong Chian Faculty of Engineering and Science, Universiti Tunku Abdul Rahman
C. H. Koo: Department of Civil Engineering, Lee Kong Chian Faculty of Engineering and Science, Universiti Tunku Abdul Rahman
Mohsen Sherif: Civil and Environmental Engineering Department, College of Engineering, United Arab Emirates University
Ali Najah Ahmed: Institute of Energy Infrastructure (IEI), Department of Civil Engineering, College of Engineering, Universiti Tenaga Nasional (UNITEN)
Ahmed El-Shafie: Department of Civil Engineering, Faculty of Engineering, University of Malaya

DOI: https://doi.org/10.1007/s13201-022-01790-5
Journal volume & issue: Vol. 13, no. 1
pp. 1 – 32

Abstract

Read online

Abstract Streamflow forecasting has always been important in water resources management, particularly the peak flow, which often determines the seriousness of the impending flood. However, the highly imbalanced flow distribution often hinders the machine learning algorithm's performance. In this paper, streamflow forecasting was approached through the formulation of two distinct machine learning problems: categorical streamflow forecast and regression streamflow forecast. Due to the distinctive characteristics of these two adopted forms, selecting the correct algorithm for the machine learning problem along with their hyperparameter tuning process is critical to the realization of the desired results. For the distinct streamflow formulated scenarios, three neural network algorithms and their hyperparameter tuning strategy were investigated. The comparative empirical studies had revealed that formulated categorical-based streamflow forecast is a better choice than a regression-based streamflow forecast, regardless of the algorithms used; for instance, the f1-score of 0.7 (categorical based) is obtained compared to the 0.53 (regression based) for the LSTM in scenario 1 (binary). Furthermore, forest-based algorithms were investigated and shown to be superior at forecasting high streamflow fluctuations in situations featuring low-dimensional streamflow input. Besides, encoding the streamflow time series as images (input) for forecasting purposes would require a thorough analysis as there is a discrepancy in the results, revealing that not all approaches are suitable for streamflow image transformation. The functional ANOVA analysis provided evidence to substantiate the Bayesian optimization results, implying that the hyperparameters were effectively optimized.

Published in Applied Water Science

ISSN: 2190-5487 (Print); 2190-5495 (Online)
Publisher: SpringerOpen
Country of publisher: Germany
LCC subjects: Technology: Environmental technology. Sanitary engineering: Water supply for domestic and industrial purposes
Website: http://www.springer.com/13201

About the journal

Abstract

Keywords