Machine Learning with Applications (Dec 2024)
Trends in audio scene source counting and analysis
Abstract
Audio scene analysis involves a variety of tasks to obtain information from an audio environment. Audio source counting is one such task that has implications to many other aspects of audio analysis, yet it is relatively unexplored. This work presents the first review of the audio source counting literature and aims to convey the significance of this task to the wider domain of audio analysis. We identify and discuss connections between audio source counting and other more commonly studied audio analysis tasks. In addition, a review of the publicly available audio datasets is presented, highlighting the lack of datasets geared towards audio source counting. Our goal of this review paper is to promote future research of audio source counting.