Applying Recurrent Neural Networks and Blocked Cross-Validation to Model Conventional Drinking Water Treatment Processes

Aleksandar Jakovljevic; Laurent Charlin; Benoit Barbeau

doi:10.3390/w16071042

Water (Apr 2024)

Applying Recurrent Neural Networks and Blocked Cross-Validation to Model Conventional Drinking Water Treatment Processes

Aleksandar Jakovljevic,
Laurent Charlin,
Benoit Barbeau

Affiliations

Aleksandar Jakovljevic: Department of Civil, Geological and Mining Engineering, Polytechnique Montréal, 2500 Chemin de Polytechnique, Montréal, QC H3T 1J4, Canada
Laurent Charlin: Department of Decision Sciences, HEC Montréal, 3000 Chemin de la Côte-Sainte-Catherine, Montréal, QC H3T 2A7, Canada
Benoit Barbeau: Department of Civil, Geological and Mining Engineering, Polytechnique Montréal, 2500 Chemin de Polytechnique, Montréal, QC H3T 1J4, Canada

DOI: https://doi.org/10.3390/w16071042
Journal volume & issue: Vol. 16, no. 7
p. 1042

Abstract

Read online

The jar test is the current standard method for predicting the performance of a conventional drinking water treatment (DWT) process and optimizing the coagulant dose. This test is time-consuming and requires human intervention, meaning it is infeasible for making continuous process predictions. As a potential alternative, we developed a machine learning (ML) model from historical DWT plant data that can operate continuously using real-time sensor data without human intervention for predicting clarified water turbidity 15 min in advance. We evaluated three types of models: multilayer perceptron (MLP), the long short-term memory (LSTM) recurrent neural network (RNN), and the gated recurrent unit (GRU) RNN. We also employed two training methodologies: the commonly used holdout method and the theoretically correct blocked cross-validation (BCV) method. We found that the RNN with GRU was the best model type overall and achieved a mean absolute error on an independent production set of as low as 0.044 NTU. We further found that models trained using BCV typically achieve errors equal to or lower than their counterparts trained using holdout. These results suggest that RNNs trained using BCV are superior for the development of ML models for DWT processes compared to those reported in earlier literature.

Published in Water

ISSN: 2073-4441 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Hydraulic engineering; Technology: Environmental technology. Sanitary engineering: Water supply for domestic and industrial purposes
Website: http://www.mdpi.com/journal/water/

About the journal

Abstract

Keywords