Scientific Reports (Jun 2023)

A collaborative and near-comprehensive North Pacific humpback whale photo-ID dataset

  • Ted Cheeseman,
  • Ken Southerland,
  • Jo Marie Acebes,
  • Katherina Audley,
  • Jay Barlow,
  • Lars Bejder,
  • Caitlin Birdsall,
  • Amanda L. Bradford,
  • Josie K. Byington,
  • John Calambokidis,
  • Rachel Cartwright,
  • Jen Cedarleaf,
  • Andrea Jacqueline García Chavez,
  • Jens J. Currie,
  • Joëlle De Weerdt,
  • Nicole Doe,
  • Thomas Doniol-Valcroze,
  • Karina Dracott,
  • Olga Filatova,
  • Rachel Finn,
  • Kiirsten Flynn,
  • John K. B. Ford,
  • Astrid Frisch-Jordán,
  • Christine M. Gabriele,
  • Beth Goodwin,
  • Craig Hayslip,
  • Jackie Hildering,
  • Marie C. Hill,
  • Jeff K. Jacobsen,
  • M. Esther Jiménez-López,
  • Meagan Jones,
  • Nozomi Kobayashi,
  • Edward Lyman,
  • Mark Malleson,
  • Evgeny Mamaev,
  • Pamela Martínez Loustalot,
  • Annie Masterman,
  • Craig Matkin,
  • Christie J. McMillan,
  • Jeff E. Moore,
  • John R. Moran,
  • Janet L. Neilson,
  • Hayley Newell,
  • Haruna Okabe,
  • Marilia Olio,
  • Adam A. Pack,
  • Daniel M. Palacios,
  • Heidi C. Pearson,
  • Ester Quintana-Rizzo,
  • Raul Fernando Ramírez Barragán,
  • Nicola Ransome,
  • Hiram Rosales-Nanduca,
  • Fred Sharpe,
  • Tasli Shaw,
  • Stephanie H. Stack,
  • Iain Staniland,
  • Jan Straley,
  • Andrew Szabo,
  • Suzie Teerlink,
  • Olga Titova,
  • Jorge Urban R.,
  • Martin van Aswegen,
  • Marcel Vinicius de Morais,
  • Olga von Ziegesar,
  • Briana Witteveen,
  • Janie Wray,
  • Kymberly M. Yano,
  • Denny Zwiefelhofer,
  • Phil Clapham

DOI
https://doi.org/10.1038/s41598-023-36928-1
Journal volume & issue
Vol. 13, no. 1
pp. 1 – 17

Abstract

Read online

Abstract We present an ocean-basin-scale dataset that includes tail fluke photographic identification (photo-ID) and encounter data for most living individual humpback whales (Megaptera novaeangliae) in the North Pacific Ocean. The dataset was built through a broad collaboration combining 39 separate curated photo-ID catalogs, supplemented with community science data. Data from throughout the North Pacific were aggregated into 13 regions, including six breeding regions, six feeding regions, and one migratory corridor. All images were compared with minimal pre-processing using a recently developed image recognition algorithm based on machine learning through artificial intelligence; this system is capable of rapidly detecting matches between individuals with an estimated 97–99% accuracy. For the 2001–2021 study period, a total of 27,956 unique individuals were documented in 157,350 encounters. Each individual was encountered, on average, in 5.6 sampling periods (i.e., breeding and feeding seasons), with an annual average of 87% of whales encountered in more than one season. The combined dataset and image recognition tool represents a living and accessible resource for collaborative, basin-wide studies of a keystone marine mammal in a time of rapid ecological change.