Water Science and Technology (Jun 2023)

Validation of wastewater data using artificial intelligence tools and the evaluation of their performance regarding annotator agreement

  • Imane Zidaoui,
  • Cédric Wemmert,
  • Matthieu Dufresne,
  • Claude Joannis,
  • Sandra Isel,
  • Jonathan Wertel,
  • José Vazquez

DOI
https://doi.org/10.2166/wst.2023.174
Journal volume & issue
Vol. 87, no. 12
pp. 2957 – 2970

Abstract

Read online

To prevent the pollution of water resources, the measurement and the limitation of wastewater discharges are required. Despite the progress in the field of data acquisition systems, sensors are subject to malfunctions that can bias the evaluation of the pollution flow. It is therefore essential to identify potential anomalies in the data before any use. The objective of this work is to deploy artificial intelligence tools to automate the data validation and to assess the added value of this approach in assisting the validation performed by an operator. To do so, we compare two state-of-the-art anomaly detection algorithms on turbidity data in a sewer network. On the one hand, we conclude that the One-class SVM model is not adapted to the nature of the studied data which is heterogeneous and noisy. The Matrix Profile model, on the other hand, provides promising results with a majority of anomalies detected and a relatively limited number of false positives. By comparing these results to the expert validation, it turns out that the use of the Matrix Profile model objectifies and accelerates the validation task while maintaining the same level of performance compared to the annotator agreement rate between two experts. HIGHLIGHTS The subjective nature of manual validation of wastewater data limits the agreement rate between different experts.; Artificial intelligence approaches based on binary clusterings, such as OC-SVM, are not adapted to anomaly detection in turbidity data.; Matrix profile is being evaluated on wastewater data for the first time and shows promising results.;

Keywords