Frontiers in Marine Science (Jun 2023)

Demystifying image-based machine learning: a practical guide to automated analysis of field imagery using modern machine learning tools

  • Byron T. Belcher,
  • Eliana H. Bower,
  • Benjamin Burford,
  • Benjamin Burford,
  • Maria Rosa Celis,
  • Maria Rosa Celis,
  • Ashkaan K. Fahimipour,
  • Ashkaan K. Fahimipour,
  • Ashkaan K. Fahimipour,
  • Isabela L. Guevara,
  • Kakani Katija,
  • Zulekha Khokhar,
  • Anjana Manjunath,
  • Samuel Nelson,
  • Samuel Nelson,
  • Simone Olivetti,
  • Simone Olivetti,
  • Eric Orenstein,
  • Mohamad H. Saleh,
  • Brayan Vaca,
  • Salma Valladares,
  • Stella A. Hein,
  • Stella A. Hein,
  • Andrew M. Hein,
  • Andrew M. Hein

DOI
https://doi.org/10.3389/fmars.2023.1157370
Journal volume & issue
Vol. 10

Abstract

Read online

Image-based machine learning methods are becoming among the most widely-used forms of data analysis across science, technology, engineering, and industry. These methods are powerful because they can rapidly and automatically extract rich contextual and spatial information from images, a process that has historically required a large amount of human labor. A wide range of recent scientific applications have demonstrated the potential of these methods to change how researchers study the ocean. However, despite their promise, machine learning tools are still under-exploited in many domains including species and environmental monitoring, biodiversity surveys, fisheries abundance and size estimation, rare event and species detection, the study of animal behavior, and citizen science. Our objective in this article is to provide an approachable, end-to-end guide to help researchers apply image-based machine learning methods effectively to their own research problems. Using a case study, we describe how to prepare data, train and deploy models, and overcome common issues that can cause models to underperform. Importantly, we discuss how to diagnose problems that can cause poor model performance on new imagery to build robust tools that can vastly accelerate data acquisition in the marine realm. Code to perform analyses is provided at https://github.com/heinsense2/AIO_CaseStudy.

Keywords