IEEE Access (Jan 2022)

A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics

  • Ahmed Abbasi,
  • Abdul Rehman Rehman Javed,
  • Amanullah Yasin,
  • Zunera Jalil,
  • Natalia Kryvinska,
  • Usman Tariq

DOI
https://doi.org/10.1109/ACCESS.2022.3166602
Journal volume & issue
Vol. 10
pp. 38885 – 38894

Abstract

Read online

With the emergence of new digital technologies, a significant surge has been seen in the volume of multimedia data generated from various smart devices. Several challenges for data analysis have emerged to extract useful information from multimedia data. One such challenge is the early and accurate detection of anomalies in multimedia data. This study proposes an efficient technique for anomaly detection and classification of rare events in audio data. In this paper, we develop a vast audio dataset containing seven different rare events (anomalies) with 15 different background environmental settings (e.g., beach, restaurant, and train) to focus on both detection of anomalous audio and classification of rare sound (e.g., events—baby cry, gunshots, broken glasses, footsteps) events for audio forensics. The proposed approach uses the supreme feature extraction technique by extracting mel-frequency cepstral coefficients (MFCCs) features from the audio signals of the newly created dataset and selects the minimum number of best-performing features for optimum performance using principal component analysis (PCA). These features are input to state-of-the-art machine learning algorithms for performance analysis. We also apply machine learning algorithms to the state-of-the-art dataset and realize good results. Experimental results reveal that the proposed approach effectively detects all anomalies and superior performance to existing approaches in all environments and cases.

Keywords