A Spiking Neural Network Framework for Robust Sound Classification

Jibin Wu; Yansong Chua; Malu Zhang; Haizhou Li; Haizhou Li; Kay Chen Tan

doi:10.3389/fnins.2018.00836

Frontiers in Neuroscience (Nov 2018)

A Spiking Neural Network Framework for Robust Sound Classification

Jibin Wu,
Yansong Chua,
Malu Zhang,
Haizhou Li,
Haizhou Li,
Kay Chen Tan

Affiliations

Jibin Wu: Department of Electrical and Computer Engineering, National University of Singapore, Singapore, Singapore
Yansong Chua: Institute for Infocomm Research, A*STAR, Singapore, Singapore
Malu Zhang: Department of Electrical and Computer Engineering, National University of Singapore, Singapore, Singapore
Haizhou Li: Department of Electrical and Computer Engineering, National University of Singapore, Singapore, Singapore
Haizhou Li: Institute for Infocomm Research, A*STAR, Singapore, Singapore
Kay Chen Tan: Department of Computer Science, City University of Hong Kong, Kowloon Tong, Hong Kong

DOI: https://doi.org/10.3389/fnins.2018.00836
Journal volume & issue: Vol. 12

Abstract

Read online

Environmental sounds form part of our daily life. With the advancement of deep learning models and the abundance of training data, the performance of automatic sound classification (ASC) systems has improved significantly in recent years. However, the high computational cost, hence high power consumption, remains a major hurdle for large-scale implementation of ASC systems on mobile and wearable devices. Motivated by the observations that humans are highly effective and consume little power whilst analyzing complex audio scenes, we propose a biologically plausible ASC framework, namely SOM-SNN. This framework uses the unsupervised self-organizing map (SOM) for representing frequency contents embedded within the acoustic signals, followed by an event-based spiking neural network (SNN) for spatiotemporal spiking pattern classification. We report experimental results on the RWCP environmental sound and TIDIGITS spoken digits datasets, which demonstrate competitive classification accuracies over other deep learning and SNN-based models. The SOM-SNN framework is also shown to be highly robust to corrupting noise after multi-condition training, whereby the model is trained with noise-corrupted sound samples. Moreover, we discover the early decision making capability of the proposed framework: an accurate classification can be made with an only partial presentation of the input.

Published in Frontiers in Neuroscience

ISSN: 1662-4548 (Print); 1662-453X (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Medicine: Internal medicine: Neurosciences. Biological psychiatry. Neuropsychiatry
Website: http://www.frontiersin.org/neuroscience

About the journal

Abstract

Keywords