IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (Jan 2024)
CTMANet: A CNN-Transformer Hybrid Semantic Segmentation Network for Fine-Grained Airport Extraction in Complex SAR Scenes
Abstract
Airports represent essential infrastructure, offering substantial research and application potential. However, extracting airports from complex synthetic aperture radar (SAR) scenes is challenging due to the cluttered background and fine structure of airports. This necessitates the integration of global and local information for fine-grained extraction. To tackle this issue, this article introduces a novel framework for fine-grained extraction of airports from large-scale SAR images. First, a convolutional neural networks (CNN) transformer hybrid semantic segmentation network with multiscale contextual fusion is proposed, named CNN-transformer network (CTMANet). In this network, the encoder combines CNNs and transformers to capture local and global information, while the multiscale context aggregation block fuses multiscale contextual information. Skip connections between the encoder and decoder are established to minimize the loss of detailed information and fuse low-level features with high-level semantic features. Moreover, a category balance block is designed to address class imbalance. Experimental results on the GF-3 dataset demonstrate that CTMANet outperforms state-of-the-art methods, proving its superior suitability for fine-grained airport extraction in large-scale scenarios.
Keywords