HybridMouse: A Hybrid Convolutional-Recurrent Neural Network-Based Model for Identification of Mouse Ultrasonic Vocalizations

Yizhaq Goussha; Kfir Bar; Shai Netser; Lior Cohen; Yacov Hel-Or; Shlomo Wagner

doi:10.3389/fnbeh.2021.810590

Frontiers in Behavioral Neuroscience (Jan 2022)

HybridMouse: A Hybrid Convolutional-Recurrent Neural Network-Based Model for Identification of Mouse Ultrasonic Vocalizations

Yizhaq Goussha,
Kfir Bar,
Shai Netser,
Lior Cohen,
Yacov Hel-Or,
Shlomo Wagner

Affiliations

Yizhaq Goussha: Sagol Department of Neurobiology, Faculty of Natural Sciences, University of Haifa, Haifa, Israel
Kfir Bar: School of Computer Science, The Interdisciplinary Center, Herzliya, Israel
Shai Netser: Sagol Department of Neurobiology, Faculty of Natural Sciences, University of Haifa, Haifa, Israel
Lior Cohen: Sagol Department of Neurobiology, Faculty of Natural Sciences, University of Haifa, Haifa, Israel
Yacov Hel-Or: School of Computer Science, The Interdisciplinary Center, Herzliya, Israel
Shlomo Wagner: Sagol Department of Neurobiology, Faculty of Natural Sciences, University of Haifa, Haifa, Israel

DOI: https://doi.org/10.3389/fnbeh.2021.810590
Journal volume & issue: Vol. 15

Abstract

Read online

Mice use ultrasonic vocalizations (USVs) to convey a variety of socially relevant information. These vocalizations are affected by the sex, age, strain, and emotional state of the emitter and can thus be used to characterize it. Current tools used to detect and analyze murine USVs rely on user input and image processing algorithms to identify USVs, therefore requiring ideal recording environments. More recent tools which utilize convolutional neural networks models to identify vocalization segments perform well above the latter but do not exploit the sequential structure of audio vocalizations. On the other hand, human voice recognition models were made explicitly for audio processing; they incorporate the advantages of CNN models in recurrent models that allow them to capture the sequential nature of the audio. Here we describe the HybridMouse software: an audio analysis tool that combines convolutional (CNN) and recurrent (RNN) neural networks for automatically identifying, labeling, and extracting recorded USVs. Following training on manually labeled audio files recorded in various experimental conditions, HybridMouse outperformed the most commonly used benchmark model utilizing deep-learning tools in accuracy and precision. Moreover, it does not require user input and produces reliable detection and analysis of USVs recorded under harsh experimental conditions. We suggest that HybrideMouse will enhance the analysis of murine USVs and facilitate their use in scientific research.

Published in Frontiers in Behavioral Neuroscience

ISSN: 1662-5153 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Medicine: Internal medicine: Neurosciences. Biological psychiatry. Neuropsychiatry
Website: http://www.frontiersin.org/behavioral_neuroscience

About the journal

Abstract

Keywords