IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (Jan 2024)
AEDNet: An Attention-Based Encoder–Decoder Network for Urban Water Extraction From High Spatial Resolution Remote Sensing Images
Abstract
Accurate water extraction from urban remote sensing images holds great significance in assisting the formulation of river and lake management policies and ensuring the sustainable development of urban water resources. However, urban high-resolution remote sensing images encompass complex spatial and semantic information, which leads to disparities between the extracted water body features based on local and global information, consequently affecting the accuracy of urban water extraction. To tackle this issue, an attention-based encoder–decoder network was proposed. In this network, the backbone employing atrous convolution (AC) facilitated the acquisition of low-level and high-level features of urban remote sensing images at various scales. Integrated with the attention mechanism, the encoder–decoder structure extracted global features in both the spatial and channel domains. Subsequently, these two types of features were merged to yield the urban water segmentation. Moreover, considering both intersection over union and class weights, a joint loss function (JLF) was introduced to further enhance the accuracy of urban water extraction. Experimental results demonstrated the strong performance of the proposed method on both GID and LoveDA datasets.
Keywords