Automatic Spatial Audio Scene Classification in Binaural Recordings of Music

Sławomir  K. Zieliński; Hyunkook Lee

doi:10.3390/app9091724

Applied Sciences (Apr 2019)

Automatic Spatial Audio Scene Classification in Binaural Recordings of Music

Sławomir K. Zieliński,
Hyunkook Lee

Affiliations

Sławomir K. Zieliński: Faculty of Computer Science, Białystok University of Technology, 15-351 Białystok, Poland
Hyunkook Lee: Applied Psychoacoustics Laboratory (APL), University of Huddersfield, Huddersfield HD1 3DH, UK

DOI: https://doi.org/10.3390/app9091724
Journal volume & issue: Vol. 9, no. 9
p. 1724

Abstract

Read online

The aim of the study was to develop a method for automatic classification of the three spatial audio scenes, differing in horizontal distribution of foreground and background audio content around a listener in binaurally rendered recordings of music. For the purpose of the study, audio recordings were synthesized using thirteen sets of binaural-room-impulse-responses (BRIRs), representing room acoustics of both semi-anechoic and reverberant venues. Head movements were not considered in the study. The proposed method was assumption-free with regards to the number and characteristics of the audio sources. A least absolute shrinkage and selection operator was employed as a classifier. According to the results, it is possible to automatically identify the spatial scenes using a combination of binaural and spectro-temporal features. The method exhibits a satisfactory classification accuracy when it is trained and then tested on different stimuli but synthesized using the same BRIRs (accuracy ranging from 74% to 98%), even in highly reverberant conditions. However, the generalizability of the method needs to be further improved. This study demonstrates that in addition to the binaural cues, the Mel-frequency cepstral coefficients constitute an important carrier of spatial information, imperative for the classification of spatial audio scenes.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords