Sound Event Detection for Human Safety and Security in Noisy Environments

Michael Neri; Federica Battisti; Alessandro Neri; Marco Carli

doi:10.1109/ACCESS.2022.3231681

IEEE Access (Jan 2022)

Sound Event Detection for Human Safety and Security in Noisy Environments

Michael Neri,
Federica Battisti,
Alessandro Neri,
Marco Carli

Affiliations

Michael Neri: ORCiD; Department of Industrial, Electronic and Mechanical Engineering, Roma Tre University, Rome, Italy
Federica Battisti: ORCiD; Department of Information Engineering, University of Padova, Padua, Italy
Alessandro Neri: ORCiD; Department of Industrial, Electronic and Mechanical Engineering, Roma Tre University, Rome, Italy
Marco Carli: ORCiD; Department of Industrial, Electronic and Mechanical Engineering, Roma Tre University, Rome, Italy

DOI: https://doi.org/10.1109/ACCESS.2022.3231681
Journal volume & issue: Vol. 10
pp. 134230 – 134240

Abstract

Read online

The objective of a sound event detector is to recognize anomalies in an audio clip and return their onset and offset. However, detecting sound events in noisy environments is a challenging task. This is due to the fact that in a real audio signal several sound sources co-exist. Moreover, the characteristics of polyphonic audios are different from isolated recordings. It is also necessary to consider the presence of noise (e.g. thermal and environmental). In this contribution, we present a sound anomaly detection system based on a fully convolutional network which exploits image spatial filtering and an Atrous Spatial Pyramid Pooling module. To cope with the lack of datasets specifically designed for sound event detection, a dataset for the specific application of noisy bus environments has been designed. The dataset has been obtained by mixing background audio files, recorded in a real environment, with anomalous events extracted from monophonic collections of labelled audios. The performances of the proposed system have been evaluated through segment-based metrics such as error rate, recall, and F1-Score. Moreover, robustness and precision have been evaluated through four different tests. The analysis of the results shows that the proposed sound event detector outperforms both state-of-the-art methods and general purpose deep learning-solutions.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords