U-Net Ensemble for Enhanced Semantic Segmentation in Remote Sensing Imagery

Ivica Dimitrovski; Vlatko Spasev; Suzana Loshkovska; Ivan Kitanovski

doi:10.3390/rs16122077

Remote Sensing (Jun 2024)

U-Net Ensemble for Enhanced Semantic Segmentation in Remote Sensing Imagery

Ivica Dimitrovski,
Vlatko Spasev,
Suzana Loshkovska,
Ivan Kitanovski

Affiliations

Ivica Dimitrovski: Faculty of Computer Science and Engineering, University Ss Cyril and Methodius, 1000 Skopje, North Macedonia
Vlatko Spasev: Faculty of Computer Science and Engineering, University Ss Cyril and Methodius, 1000 Skopje, North Macedonia
Suzana Loshkovska: Faculty of Computer Science and Engineering, University Ss Cyril and Methodius, 1000 Skopje, North Macedonia
Ivan Kitanovski: Faculty of Computer Science and Engineering, University Ss Cyril and Methodius, 1000 Skopje, North Macedonia

DOI: https://doi.org/10.3390/rs16122077
Journal volume & issue: Vol. 16, no. 12
p. 2077

Abstract

Read online

Semantic segmentation of remote sensing imagery stands as a fundamental task within the domains of both remote sensing and computer vision. Its objective is to generate a comprehensive pixel-wise segmentation map of an image, assigning a specific label to each pixel. This facilitates in-depth analysis and comprehension of the Earth’s surface. In this paper, we propose an approach for enhancing semantic segmentation performance by employing an ensemble of U-Net models with three different backbone networks: Multi-Axis Vision Transformer, ConvFormer, and EfficientNet. The final segmentation maps are generated through a geometric mean ensemble method, leveraging the diverse representations learned by each backbone network. The effectiveness of the base U-Net models and the proposed ensemble is evaluated on multiple datasets commonly used for semantic segmentation tasks in remote sensing imagery, including LandCover.ai, LoveDA, INRIA, UAVid, and ISPRS Potsdam datasets. Our experimental results demonstrate that the proposed approach achieves state-of-the-art performance, showcasing its effectiveness and robustness in accurately capturing the semantic information embedded within remote sensing images.

Published in Remote Sensing

ISSN: 2072-4292 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science
Website: http://www.mdpi.com/journal/remotesensing/

About the journal

Abstract

Keywords