IEEE Access (Jan 2024)

Convolutional Neural Networks for Enhancing Detection of Dolphin Whistles in a Dense Acoustic Environment

  • David Scaradozzi,
  • Rocco de Marco,
  • Daniel Li Veli,
  • Alessandro Lucchetti,
  • Laura Screpanti,
  • Francesco Di Nardo

DOI
https://doi.org/10.1109/ACCESS.2024.3454815
Journal volume & issue
Vol. 12
pp. 127141 – 127148

Abstract

Read online

Latest developments in acoustic research suggest that using surveying methods based on artificial intelligence (AI) could improve the effectiveness of underwater monitoring. Passive acoustic monitoring (PAM) has proven to be a cost-effective approach for gathering information about the acoustic behavior of dolphins and plays a crucial role in studying their vocalizations, particularly whistles. This study investigates the efficiency of a binary convolutional neural network (CNN) in detecting dolphin whistles amidst high-density vocalizations in an aquatic environment. Specifically, this analysis intends to determine whether a properly trained CNN can recognize a single whistle even in challenging condition, including situations where multiple dolphins vocalize simultaneously, resulting in overlapping whistles that may have different shapes and durations. To this aim, experimental trials were conducted at Oltremare marine park, Riccione, Italy, where underwater recordings of seven-dolphin vocalizations were collected over 22 consecutive hours. The CNN was trained on labeled whistle spectrograms. The model, comprising three convolutional layers followed by max pooling layers and rectified linear unit (ReLU) activation functions, was evaluated using a 10-fold cross-validation approach. Confusion matrix and performance metrics indicate that the proposed approach achieves results comparable to those reported in the literature, despite the more challenging working conditions. The study supports the potential of AI models in enhancing passive acoustic monitoring techniques.

Keywords