Methods in Ecology and Evolution (Jan 2024)

ECOGEN: Bird sounds generation using deep learning

  • Axel‐Christian Guei,
  • Sylvain Christin,
  • Nicolas Lecomte,
  • Éric Hervet

DOI
https://doi.org/10.1111/2041-210X.14239
Journal volume & issue
Vol. 15, no. 1
pp. 69 – 79

Abstract

Read online

Abstract Large‐scale acoustic projects generate vast amounts of data that can now be efficiently processed using deep learning tools. However, these tools often face limitations due to sound labeling and imbalanced sampling. Data augmentation can help overcome such challenges, particularly through the generation of synthetic and lifelike sounds. Synthetic samples can be valuable not only for deep learning but also for species with limited available data. Despite advancements in computer power, sound generation remains a time‐consuming process, even requiring a substantial number of samples. We present ECOGEN, a novel deep learning approach designed to generate realistic bird songs for biologists and ecologists. The primary objective of ECOGEN is to enhance the number of samples in under‐represented bird song classes, thereby improving the performance and robustness of classifiers in ecological research.The ECOGEN framework employs spectrograms as a representation of bird songs and leverages proven image generation techniques to create new spectrograms, subsequently converted back to digital audio signals. As a class‐agnostic tool, ECOGEN is applicable to a wide range of biophonic sounds, including mammal and insect calls. We show that adding samples generated by ECOGEN to a bird song classifier improved the classification accuracy by 12% on average and improved results compared with classic data augmentation techniques 80% of the time. Our approach is both fast and efficient, enabling the generation of synthetic bird songs on standard computing resources. By facilitating the creation of synthetic bird songs, ECOGEN can contribute to the conservation of endangered bird species, while providing valuable insights into their vocalizations, behaviours and habitat preferences. Future development of ECOGEN can be easily implemented and could focus on incorporating additional configurable parameters during the generation phase for increased control over the output, catering to the specific needs of biologists.

Keywords