Frontiers in Environmental Science (Nov 2024)
Spatiotemporal pattern of water hyacinth (Pontederia crassipes) distribution in Lake Tana, Ethiopia, using a random forest machine learning model
Abstract
Water hyacinth (Pontederia crassipes) is an invasive weed that covers a significant portion of Lake Tana. The infestation has an impact on the lake’s ecological and socioeconomic systems. Early detection of the spread of water hyacinth using geospatial techniques is crucial for its effective management and control. The main objective of this study was to examine the spatiotemporal distribution of water hyacinth from 2016 to 2022 using a random forest machine learning model. The study used 16 variables obtained from Sentinel-2A, Sentinel-1 SAR, and SRTM DEM, and a random forest supervised classification model was applied. Seven spectral indices, five spectral bands, two Sentinel-1 SAR bands, and two topographic variables were used in combination to model the spatial distribution of water hyacinth. The model was evaluated using the overall accuracy and kappa coefficient. The findings demonstrated that the overall accuracy ranged from 0.91 to 0.94 and kappa coefficient from 0.88 to 0.92 in the wet season and 0.93 to 0.95 and 0.90 to 0.93 in the dry season, respectively. B11 and B5 (2022), VH, soil adjusted vegetation index (SAVI), and normalized difference water index (NDWI) (2020), B5 and B12 (2018), and VH and slope (2016) are the highly important variables in the classification. The study found that the spatial coverage of water hyacinth was 686.5 and 650.4 ha (2016), 1,851 and 1,259 ha (2018), 1,396.7 and 1,305.7 ha (2020), and 1,436.5 and 1,216.5 ha (2022) in the wet and dry seasons, respectively. The research findings indicate that variables derived from optical (Sentinel-2A and SRTM) and non-optical (Sentinel-1 SAR) satellite imagery effectively identify water hyacinth and display its spatiotemporal spread using the random forest machine learning algorithm.
Keywords