Semantic Segmentation of Aerial Imagery Using U-Net with Self-Attention and Separable Convolutions

Bakht Alam Khan; Jin-Woo Jung

doi:10.3390/app14093712

Applied Sciences (Apr 2024)

Semantic Segmentation of Aerial Imagery Using U-Net with Self-Attention and Separable Convolutions

Bakht Alam Khan,
Jin-Woo Jung

Affiliations

Bakht Alam Khan: Department of Computer Science and Engineering, Dongguk University, Seoul 04620, Republic of Korea
Jin-Woo Jung: Department of Computer Science and Engineering, Dongguk University, Seoul 04620, Republic of Korea

DOI: https://doi.org/10.3390/app14093712
Journal volume & issue: Vol. 14, no. 9
p. 3712

Abstract

Read online

This research addresses the crucial task of improving accuracy in the semantic segmentation of aerial imagery, essential for applications such as urban planning and environmental monitoring. This study emphasizes the significance of maintaining the Intersection over Union (IOU) score as a metric and employs data augmentation with the Patchify library, using a patch size of 256, to effectively augment the dataset, which is subsequently split into training and testing sets. The core of this investigation lies in a novel architecture that combines a U-Net framework with self-attention mechanisms and separable convolutions. The introduction of self-attention mechanisms enhances the model’s understanding of image context, while separable convolutions expedite the training process, contributing to overall efficiency. The proposed model demonstrates a substantial accuracy improvement, surpassing the previous state-of-the-art Dense Plus U-Net, achieving an accuracy of 91% compared to the former’s 86%. Visual representations, including original patch images, original masked patches, and predicted patch masks, showcase the model’s proficiency in semantic segmentation, marking a significant advancement in aerial image analysis and underscoring the importance of innovative architectural elements for enhanced accuracy and efficiency in such tasks.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords