Water Science and Technology (Feb 2022)

Water quality prediction based on Naïve Bayes algorithm

  • M. Ilić,
  • Z. Srdjević,
  • B. Srdjević

DOI
https://doi.org/10.2166/wst.2022.006
Journal volume & issue
Vol. 85, no. 4
pp. 1027 – 1039

Abstract

Read online

In the fast-changing world with increased water demand, water pollution, environmental problems, and related data, information on water quality and suitability for any purpose should be prompt and reliable. Traditional approaches often fail in the attempt to predict water quality classes and new ones are needed to handle a large amount or missing data to predict water quality in real time. One of such approaches is machine-learning (ML) based prediction. This paper presents the results of the application of the Naïve Bayes, a widely used ML method, in creating the prediction model. The proposed model is based on nine water quality parameters: temperature, pH value, electrical conductivity, oxygen saturation, biological oxygen demand, suspended solids, nitrogen oxides, orthophosphates, and ammonium. It is created in Netica software and tested and verified using data covering the period 2013–2019 from five locations in Vojvodina Province, Serbia. Forty-eight samples were used to train the model. Once trained, the Naïve Bayes model correctly predicted the class of water sample in 64 out of 68 cases, including cases with missing data. This recommends it as a trustful tool in the transition from traditional to digital water management. HIGHLIGHTS The study tests efficiency of water quality prediction by the Naïve Bayes method.; Nine water quality parameters are analyzed, defined by SWQI methodology.; Water quality is assessed by the Naïve Bayes model at five locations in Serbia and 68 samples (cases) of data.; Prediction model is trained on 48 cases.; The model predicted the water quality class accurately in 64 out of 68 cases.;

Keywords