Remote Sensing (Sep 2024)

Leveraging Machine Learning and Remote Sensing for Water Quality Analysis in Lake Ranco, Southern Chile

  • Lien Rodríguez-López,
  • Lisandra Bravo Alvarez,
  • Iongel Duran-Llacer,
  • David E. Ruíz-Guirola,
  • Samuel Montejo-Sánchez,
  • Rebeca Martínez-Retureta,
  • Ernesto López-Morales,
  • Luc Bourrel,
  • Frédéric Frappart,
  • Roberto Urrutia

DOI
https://doi.org/10.3390/rs16183401
Journal volume & issue
Vol. 16, no. 18
p. 3401

Abstract

Read online

This study examines the dynamics of limnological parameters of a South American lake located in southern Chile with the objective of predicting chlorophyll-a levels, which are a key indicator of algal biomass and water quality, by integrating combined remote sensing and machine learning techniques. Employing four advanced machine learning models (recurrent neural network (RNNs), long short-term memory (LSTM), recurrent gate unit (GRU), and temporal convolutional network (TCNs)), the research focuses on the estimation of chlorophyll-a concentrations at three sampling stations within Lake Ranco. The data span from 1987 to 2020 and are used in three different cases: using only in situ data (Case 1), using in situ and meteorological data (Case 2), using in situ, and meteorological and satellite data from Landsat and Sentinel missions (Case 3). In all cases, each machine learning model shows robust performance, with promising results in predicting chlorophyll-a concentrations. Among these models, LSTM stands out as the most effective, with the best metrics in the estimation, the best performance was Case 1, with R2 = 0.89, an RSME of 0.32 µg/L, an MAE 1.25 µg/L and an MSE 0.25 (µg/L)2, consistently outperforming the others according to the static metrics used for validation. This finding underscores the effectiveness of LSTM in capturing the complex temporal relationships inherent in the dataset. However, increasing the dataset in Case 3 shows a better performance of TCNs (R2 = 0.96; MSE = 0.33 (µg/L)2; RMSE = 0.13 µg/L; and MAE = 0.06 µg/L). The successful application of machine learning algorithms emphasizes their potential to elucidate the dynamics of algal biomass in Lake Ranco, located in the southern region of Chile. These results not only contribute to a deeper understanding of the lake ecosystem but also highlight the utility of advanced computational techniques in environmental research and management.

Keywords