Aerial Fluvial Image Dataset for Deep Semantic Segmentation Neural Networks and Its Benchmarks

Zihan Wang; Nina Mahmoudian

doi:10.1109/JSTARS.2023.3275068

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (Jan 2023)

Aerial Fluvial Image Dataset for Deep Semantic Segmentation Neural Networks and Its Benchmarks

Zihan Wang,
Nina Mahmoudian

Affiliations

Zihan Wang: ORCiD; School of Mechanical Engineering, Purdue University, West Lafayette, IN, USA
Nina Mahmoudian: ORCiD; School of Mechanical Engineering, Purdue University, West Lafayette, IN, USA

DOI: https://doi.org/10.1109/JSTARS.2023.3275068
Journal volume & issue: Vol. 16
pp. 4755 – 4766

Abstract

Read online

Classification of aerial imagery is essential for water channel surveillance and waterfront land cover characterization. It is also beneficial to long-duration collaborative autonomous navigation of both unmanned aerial vehicles (UAVs) and autonomous surface vehicles (ASVs) to fulfill unmanned hydrologic data collection, environmental inspection, and disaster warning tasks. Deep semantic segmentation networks trained on aerial imagery have shown great results, however, they require finely labeled data. Existing aerial image datasets contain mostly urban scenes or fluvial images taken from ground level or collected from the Internet, there are no datasets that incorporate aerial and fluvial scenes with detailed annotation from different perspectives or include waterborne obstacles. To tackle this problem, aerial fluvial image dataset (AFID) is presented with multiple camera perspectives of fluvial scenes and is semantically labeled with emphasis on water and waterborne obstacles. Deep neural networks for binary (water and nonwater) semantic segmentation, with 12 different combinations of five encoders and three decoding architectures, are trained and tested in a curriculum learning scheme. Model performance is benchmarked on AFID, and the accuracy-efficiency tradeoff is discussed with the conclusion that the Unet architecture with a mix transformer encoder achieves the best segmentation performance with moderate computational consumption. The AFID dataset is publicly available to facilitate future work on developing new lightweight semantic segmentation models. Our immediate future plan will focus on the coordination of air and surface-water autonomous systems for navigable water detection and obstacle avoidance in high-risk challenging environments.

Published in IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

ISSN: 1939-1404 (Print); 2151-1535 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Ocean engineering; Science: Physics: Geophysics. Cosmic physics
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=4609443

About the journal

Abstract

Keywords