Machine Learning Undercounts Reproductive Organs on Herbarium Specimens but Accurately Derives Their Quantitative Phenological Status: A Case Study of <i>Streptanthus tortuosus</i>

Natalie L. R. Love; Pierre Bonnet; Hervé Goëau; Alexis Joly; Susan J. Mazer

doi:10.3390/plants10112471

Plants (Nov 2021)

Machine Learning Undercounts Reproductive Organs on Herbarium Specimens but Accurately Derives Their Quantitative Phenological Status: A Case Study of <i>Streptanthus tortuosus</i>

Natalie L. R. Love,
Pierre Bonnet,
Hervé Goëau,
Alexis Joly,
Susan J. Mazer

Affiliations

Natalie L. R. Love: Department of Ecology, Evolution, and Marine Biology, University of California, Santa Barbara, CA 93106, USA
Pierre Bonnet: Botany and Modeling of Plant Architecture and Vegetation (AMAP), French Agricultural Research Centre for International Development (CIRAD), French National Centre for Scientific Research (CNRS), French National Institute for Agriculture, Food and Environment (INRAE), Research Institute for Development (IRD), University of Montpellier, 34398 Montpellier, France
Hervé Goëau: Botany and Modeling of Plant Architecture and Vegetation (AMAP), French Agricultural Research Centre for International Development (CIRAD), French National Centre for Scientific Research (CNRS), French National Institute for Agriculture, Food and Environment (INRAE), Research Institute for Development (IRD), University of Montpellier, 34398 Montpellier, France
Alexis Joly: ZENITH Team, Laboratory of Informatics, Robotics and Microelectronics-Joint Research Unit, Institut National de Recherche en Informatique et en Automatique (INRIA) Sophia-Antipolis, CEDEX 5, 34095 Montpellier, France
Susan J. Mazer: Department of Ecology, Evolution, and Marine Biology, University of California, Santa Barbara, CA 93106, USA

DOI: https://doi.org/10.3390/plants10112471
Journal volume & issue: Vol. 10, no. 11
p. 2471

Abstract

Read online

Machine learning (ML) can accelerate the extraction of phenological data from herbarium specimens; however, no studies have assessed whether ML-derived phenological data can be used reliably to evaluate ecological patterns. In this study, 709 herbarium specimens representing a widespread annual herb, Streptanthus tortuosus, were scored both manually by human observers and by a mask R-CNN object detection model to (1) evaluate the concordance between ML and manually-derived phenological data and (2) determine whether ML-derived data can be used to reliably assess phenological patterns. The ML model generally underestimated the number of reproductive structures present on each specimen; however, when these counts were used to provide a quantitative estimate of the phenological stage of plants on a given sheet (i.e., the phenological index or PI), the ML and manually-derived PI’s were highly concordant. Moreover, herbarium specimen age had no effect on the estimated PI of a given sheet. Finally, including ML-derived PIs as predictor variables in phenological models produced estimates of the phenological sensitivity of this species to climate, temporal shifts in flowering time, and the rate of phenological progression that are indistinguishable from those produced by models based on data provided by human observers. This study demonstrates that phenological data extracted using machine learning can be used reliably to estimate the phenological stage of herbarium specimens and to detect phenological patterns.

Published in Plants

ISSN: 2223-7747 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Botany
Website: http://www.mdpi.com/journal/plants

About the journal

Abstract

Keywords