Frontiers in Marine Science (May 2023)

An iterative labeling method for annotating marine life imagery

  • Zhiyong Zhang,
  • Pushyami Kaveti,
  • Hanumant Singh,
  • Abigail Powell,
  • Erica Fruh,
  • M. Elizabeth Clarke

DOI
https://doi.org/10.3389/fmars.2023.1094190
Journal volume & issue
Vol. 10

Abstract

Read online

This paper presents a labeling methodology for marine life data using a weakly supervised learning framework. The methodology iteratively trains a deep learning model using non-expert labels obtained from crowdsourcing. This approach enables us to converge on a labeled image dataset through multiple training and production loops that leverage crowdsourcing interfaces. We present our algorithm and its results on two separate sets of image data collected using the Seabed autonomous underwater vehicle. The first dataset consists of 10,505 images that were point annotated by NOAA biologists. This dataset allows us to validate the accuracy of our labeling process. We also apply our algorithm and methodology to a second dataset consisting of 3,968 completely unlabeled images. These image categories are challenging to label, such as sponges. Qualitatively, our results indicate that training with a tiny subset and iterating on those results allows us to converge to a large, highly annotated dataset with a small number of iterations. To demonstrate the effectiveness of our methodology quantitatively, we tabulate the mean average precision (mAP) of the model as the number of iterations increases.

Keywords