Scientific Reports (Jul 2022)
Machine learning approach towards explaining water quality dynamics in an urbanised river
Abstract
Abstract Human activities alter river water quality and quantity, with consequences for the ecosystems of urbanised rivers. Quantifying the role of human-induced drivers in controlling spatio-temporal patterns in water quality is critical to develop successful strategies for improving the ecological health of urban rivers. Here, we analyse high-frequency electrical conductivity and temperature data collected from the River Chess in South-East England during a Citizen Science project. Utilizing machine learning, we find that boosted trees outperform GAM and accurately describe water quality dynamics with less than 1% error. SHapley Additive exPlanations reveal the importance of and the (inter)dependencies between the individual variables, such as river level and Wastewater Treatment Works (WWTW) outflow. WWTW outflows give rise to diurnal variations in electrical conductivity, which are detectable throughout the year, and to an increase in average water temperature of 1 $$\rm{^o}C$$ o C in a 2 km reach downstream of the wastewater treatment works during low flows. Overall, we showcase how high-frequency water quality measurements initiated by a Citizen Science project, together with machine learning techniques, can help untangle key drivers of water quality dynamics in an urbanised chalk stream.