An Overview of Audio Event Detection Methods from Feature Extraction to Classification

Elham Babaee; Nor Badrul Anuar; Ainuddin Wahid Abdul Wahab; Shahaboddin Shamshirband; Anthony T. Chronopoulos

doi:10.1080/08839514.2018.1430469

Applied Artificial Intelligence (Nov 2017)

An Overview of Audio Event Detection Methods from Feature Extraction to Classification

Elham Babaee,
Nor Badrul Anuar,
Ainuddin Wahid Abdul Wahab,
Shahaboddin Shamshirband,
Anthony T. Chronopoulos

Affiliations

Elham Babaee: University of Malaya
Nor Badrul Anuar: University of Malaya
Ainuddin Wahid Abdul Wahab: University of Malaya
Shahaboddin Shamshirband: Ton Duc Thang University
Anthony T. Chronopoulos: University of Texas

DOI: https://doi.org/10.1080/08839514.2018.1430469
Journal volume & issue: Vol. 31, no. 9-10
pp. 661 – 714

Abstract

Read online

Audio streams, such as news broadcasting, meeting rooms, and special video comprise sound from an extensive variety of sources. The detection of audio events including speech, coughing, gunshots, etc. leads to intelligent audio event detection (AED). With substantial attention geared to AED for various types of applications, such as security, speech recognition, speaker recognition, home care, and health monitoring, scientists are now more motivated to perform extensive research on AED. The deployment of AED is actually a more complicated task when going beyond exclusively highlighting audio events in terms of feature extraction and classification in order to select the best features with high detection accuracy. To date, a wide range of different detection systems based on intelligent techniques have been utilized to create machine learning-based audio event detection schemes. Nevertheless, the preview study does not encompass any state-of-the-art reviews of the proficiency and significances of such methods for resolving audio event detection matters. The major contribution of this work entails reviewing and categorizing existing AED schemes into preprocessing, feature extraction, and classification methods. The importance of the algorithms and methodologies and their proficiency and restriction are additionally analyzed in this study. This research is expanded by critically comparing audio detection methods and algorithms according to accuracy and false alarms using different types of datasets.

Published in Applied Artificial Intelligence

ISSN: 0883-9514 (Print); 1087-6545 (Online)
Publisher: Taylor & Francis Group
Country of publisher: United Kingdom
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science; Science: Science (General): Cybernetics
Website: https://www.tandfonline.com/journals/uaai

About the journal