A machine learning approach to site groundwater contamination monitoring wells

V. Gómez-Escalonilla; E. Montero-González; S. Díaz-Alcaide; M. Martín-Loeches; M. Rodríguez del Rosario; P. Martínez-Santos

doi:10.1007/s13201-024-02320-1

Applied Water Science (Nov 2024)

A machine learning approach to site groundwater contamination monitoring wells

V. Gómez-Escalonilla,
E. Montero-González,
S. Díaz-Alcaide,
M. Martín-Loeches,
M. Rodríguez del Rosario,
P. Martínez-Santos

Affiliations

V. Gómez-Escalonilla: Departamento de Geodinámica, Estratigrafía y Paleontología, Facultad de Ciencias Geológicas, Universidad Complutense de Madrid
E. Montero-González: Departamento de Geodinámica, Estratigrafía y Paleontología, Facultad de Ciencias Geológicas, Universidad Complutense de Madrid
S. Díaz-Alcaide: Departamento de Geodinámica, Estratigrafía y Paleontología, Facultad de Ciencias Geológicas, Universidad Complutense de Madrid
M. Martín-Loeches: Departamento de Geología, Geografía y Medio Ambiente, Unidad Docente de Geología, Universidad de Alcalá
M. Rodríguez del Rosario: Departamento de Geodinámica, Estratigrafía y Paleontología, Facultad de Ciencias Geológicas, Universidad Complutense de Madrid
P. Martínez-Santos: Departamento de Geodinámica, Estratigrafía y Paleontología, Facultad de Ciencias Geológicas, Universidad Complutense de Madrid

DOI: https://doi.org/10.1007/s13201-024-02320-1
Journal volume & issue: Vol. 14, no. 12
pp. 1 – 19

Abstract

Read online

Abstract Effective monitoring of groundwater contamination is crucial to protect human livelihoods and ecosystems. This paper presents a machine learning-based approach to improve groundwater monitoring networks by providing predictions of groundwater contamination in space. The method is demonstrated through a practical application in Central Spain, where nitrate was used as a proxy for groundwater contamination. Predictive mapping identifies the spatial markers for groundwater contamination based on twenty-four predictor variables and a dataset of 213 existing monitoring boreholes. Tree-based algorithms found meaningful associations between the explanatory variables and known nitrate concentrations. Comparing the outcomes of the algorithms with the areas officially delineated as vulnerable to nitrate suggests that machine learning algorithms are able to predict groundwater contamination. The extra trees algorithm outperformed decision trees, random forest, gradient boosting, and AdaBoost classifiers, with an area under the curve score in excess of 0.88. Major predictors for groundwater contamination were depth to the water table, lithology, distance to rivers, and distance to livestock farms. Predictive mapping suggests that there are unmonitored regions to the northeast and to the southwest of Madrid’s metropolitan area that present similar markers to monitored regions known to be contaminated. These unmonitored areas should be prioritized in future attempts to improve the network. From a research perspective, the main conclusion of this work is that machine learning techniques can be used as a technique to automate the siting of monitoring boreholes. Practical applications should nevertheless be overseen by an expert eye to guarantee the quality of the outcomes.

Published in Applied Water Science

ISSN: 2190-5487 (Print); 2190-5495 (Online)
Publisher: SpringerOpen
Country of publisher: Germany
LCC subjects: Technology: Environmental technology. Sanitary engineering: Water supply for domestic and industrial purposes
Website: http://www.springer.com/13201

About the journal

Abstract

Keywords