Scientific Reports (May 2023)

Machine learning and statistical classification of birdsong link vocal acoustic features with phylogeny

  • Moises Rivera,
  • Jacob A. Edwards,
  • Mark E. Hauber,
  • Sarah M. N. Woolley

DOI
https://doi.org/10.1038/s41598-023-33825-5
Journal volume & issue
Vol. 13, no. 1
pp. 1 – 18

Abstract

Read online

Abstract Birdsong is a longstanding model system for studying evolution and biodiversity. Here, we collected and analyzed high quality song recordings from seven species in the family Estrildidae. We measured the acoustic features of syllables and then used dimensionality reduction and machine learning classifiers to identify features that accurately assigned syllables to species. Species differences were captured by the first 3 principal components, corresponding to basic frequency, power distribution, and spectrotemporal features. We then identified the measured features underlying classification accuracy. We found that fundamental frequency, mean frequency, spectral flatness, and syllable duration were the most informative features for species identification. Next, we tested whether specific acoustic features of species’ songs predicted phylogenetic distance. We found significant phylogenetic signal in syllable frequency features, but not in power distribution or spectrotemporal features. Results suggest that frequency features are more constrained by species’ genetics than are other features, and are the best signal features for identifying species from song recordings. The absence of phylogenetic signal in power distribution and spectrotemporal features suggests that these song features are labile, reflecting learning processes and individual recognition.