Sound Classification Using Convolutional Neural Network and Tensor Deep Stacking Network

Aditya Khamparia; Deepak Gupta; Nhu Gia Nguyen; Ashish Khanna; Babita Pandey; Prayag Tiwari

doi:10.1109/ACCESS.2018.2888882

IEEE Access (Jan 2019)

Sound Classification Using Convolutional Neural Network and Tensor Deep Stacking Network

Aditya Khamparia,
Deepak Gupta,
Nhu Gia Nguyen,
Ashish Khanna,
Babita Pandey,
Prayag Tiwari

Affiliations

Aditya Khamparia: School of Computer Science and Engineering, Lovely Professional University, Phagwara, India
Deepak Gupta: Maharaja Agrasen Institute of Technology, New Delhi, India
Nhu Gia Nguyen: ORCiD; Graduate School, Computer Science, Duy Tan University, Da Nang, Vietnam
Ashish Khanna: Maharaja Agrasen Institute of Technology, New Delhi, India
Babita Pandey: Department of Computer and Information Technology, Babasaheb Bhimrao Ambedkar University, Lucknow, India
Prayag Tiwari: ORCiD; Department of Information Engineering, University of Padova, Padua, Italy

DOI: https://doi.org/10.1109/ACCESS.2018.2888882
Journal volume & issue: Vol. 7
pp. 7717 – 7727

Abstract

Read online

In every aspect of human life, sound plays an important role. From personal security to critical surveillance, sound is a key element to develop the automated systems for these fields. Few systems are already in the market, but their efficiency is a point of concern for their implementation in real-life scenarios. The learning capabilities of the deep learning architectures can be used to develop the sound classification systems to overcome efficiency issues of the traditional systems. Our aim, in this paper, is to use the deep learning networks for classifying the environmental sounds based on the generated spectrograms of these sounds. We used the spectrogram images of environmental sounds to train the convolutional neural network (CNN) and the tensor deep stacking network (TDSN). We used two datasets for our experiment: ESC-10 and ESC-50. Both systems were trained on these datasets, and the achieved accuracy was 77% and 49% in CNN and 56% in TDSN trained on the ESC-10. From this experiment, it is concluded that the proposed approach for sound classification using the spectrogram images of sounds can be efficiently used to develop the sound classification and recognition systems.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords