Self-Supervised Transfer Learning from Natural Images for Sound Classification

Sungho Shin; Jongwon Kim; Yeonguk Yu; Seongju Lee; Kyoobin Lee

doi:10.3390/app11073043

Applied Sciences (Mar 2021)

Self-Supervised Transfer Learning from Natural Images for Sound Classification

Sungho Shin,
Jongwon Kim,
Yeonguk Yu,
Seongju Lee,
Kyoobin Lee

Affiliations

Sungho Shin: School of Integrated Technology, Gwangju Institute of Science and Technology, Gwangju 61005, Korea
Jongwon Kim: School of Integrated Technology, Gwangju Institute of Science and Technology, Gwangju 61005, Korea
Yeonguk Yu: School of Integrated Technology, Gwangju Institute of Science and Technology, Gwangju 61005, Korea
Seongju Lee: School of Integrated Technology, Gwangju Institute of Science and Technology, Gwangju 61005, Korea
Kyoobin Lee: School of Integrated Technology, Gwangju Institute of Science and Technology, Gwangju 61005, Korea

DOI: https://doi.org/10.3390/app11073043
Journal volume & issue: Vol. 11, no. 7
p. 3043

Abstract

Read online

We propose the implementation of transfer learning from natural images to audio-based images using self-supervised learning schemes. Through self-supervised learning, convolutional neural networks (CNNs) can learn the general representation of natural images without labels. In this study, a convolutional neural network was pre-trained with natural images (ImageNet) via self-supervised learning; subsequently, it was fine-tuned on the target audio samples. Pre-training with the self-supervised learning scheme significantly improved the sound classification performance when validated on the following benchmarks: ESC-50, UrbanSound8k, and GTZAN. The network pre-trained via self-supervised learning achieved a similar level of accuracy as those pre-trained using a supervised method that require labels. Therefore, we demonstrated that transfer learning from natural images contributes to improvements in audio-related tasks, and self-supervised learning with natural images is adequate for pre-training scheme in terms of simplicity and effectiveness.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords