On Improving the Training of Models for the Semantic Segmentation of Benthic Communities from Orthographic Imagery

Gaia Pavoni; Massimiliano Corsini; Marco Callieri; Giuseppe Fiameni; Clinton Edwards; Paolo Cignoni

doi:10.3390/rs12183106

Remote Sensing (Sep 2020)

On Improving the Training of Models for the Semantic Segmentation of Benthic Communities from Orthographic Imagery

Gaia Pavoni,
Massimiliano Corsini,
Marco Callieri,
Giuseppe Fiameni,
Clinton Edwards,
Paolo Cignoni

Affiliations

Gaia Pavoni: Visual Computing Lab (ISTI-CNR), 56124 Pisa, Italy
Massimiliano Corsini: Visual Computing Lab (ISTI-CNR), 56124 Pisa, Italy
Marco Callieri: Visual Computing Lab (ISTI-CNR), 56124 Pisa, Italy
Giuseppe Fiameni: NVIDIA AI Technology Centre (NVAITC), 40134 Bologna, Italy
Clinton Edwards: Scripps Institution of Oceanography, UC San Diego, La Jolla, CA 92037, USA
Paolo Cignoni: Visual Computing Lab (ISTI-CNR), 56124 Pisa, Italy

DOI: https://doi.org/10.3390/rs12183106
Journal volume & issue: Vol. 12, no. 18
p. 3106

Abstract

Read online

The semantic segmentation of underwater imagery is an important step in the ecological analysis of coral habitats. To date, scientists produce fine-scale area annotations manually, an exceptionally time-consuming task that could be efficiently automatized by modern CNNs. This paper extends our previous work presented at the 3DUW’19 conference, outlining the workflow for the automated annotation of imagery from the first step of dataset preparation, to the last step of prediction reassembly. In particular, we propose an ecologically inspired strategy for an efficient dataset partition, an over-sampling methodology targeted on ortho-imagery, and a score fusion strategy. We also investigate the use of different loss functions in the optimization of a Deeplab V3+ model, to mitigate the class-imbalance problem and improve prediction accuracy on coral instance boundaries. The experimental results demonstrate the effectiveness of the ecologically inspired split in improving model performance, and quantify the advantages and limitations of the proposed over-sampling strategy. The extensive comparison of the loss functions gives numerous insights on the segmentation task; the Focal Tversky, typically used in the context of medical imaging (but not in remote sensing), results in the most convenient choice. By improving the accuracy of automated ortho image processing, the results presented here promise to meet the fundamental challenge of increasing the spatial and temporal scale of coral reef research, allowing researchers greater predictive ability to better manage coral reef resilience in the context of a changing environment.

Published in Remote Sensing

ISSN: 2072-4292 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science
Website: http://www.mdpi.com/journal/remotesensing/

About the journal

Abstract

Keywords