VI-PANN: Harnessing Transfer Learning and Uncertainty-Aware Variational Inference for Improved Generalization in Audio Pattern Recognition

John Fischer; Marko Orescanin; Eric Eckstrand

doi:10.1109/ACCESS.2024.3372423

IEEE Access (Jan 2024)

VI-PANN: Harnessing Transfer Learning and Uncertainty-Aware Variational Inference for Improved Generalization in Audio Pattern Recognition

John Fischer,
Marko Orescanin,
Eric Eckstrand

Affiliations

John Fischer: ORCiD; Department of Computer Science, Naval Postgraduate School, Monterey, CA, USA
Marko Orescanin: ORCiD; Department of Computer Science, Naval Postgraduate School, Monterey, CA, USA
Eric Eckstrand: ORCiD; Data Science and Analytics Group, Naval Postgraduate School, Monterey, CA, USA

DOI: https://doi.org/10.1109/ACCESS.2024.3372423
Journal volume & issue: Vol. 12
pp. 33347 – 33360

Abstract

Read online

Transfer learning (TL) is an increasingly popular approach to training deep learning (DL) models that leverages the knowledge gained by training a foundation model on diverse, large-scale datasets for use on downstream tasks where less domain- or task-specific data is available. The literature is rich with TL techniques and applications; however, the bulk of the research makes use of deterministic DL models which are often uncalibrated and lack the ability to communicate a measure of epistemic (model) uncertainty in prediction. Unlike their deterministic counterparts, Bayesian DL (BDL) models are often well-calibrated, provide access to epistemic uncertainty for a prediction, and are capable of achieving competitive predictive performance. In this study, we propose variational inference pre-trained audio neural networks (VI-PANNs). VI-PANNs are a variational inference variant of the popular ResNet-54 architecture which are pre-trained on AudioSet, a large-scale audio event detection dataset. We evaluate the quality of the resulting uncertainty when transferring knowledge from VI-PANNs to other downstream acoustic classification tasks using the ESC-50, UrbanSound8K, and DCASE2013 datasets. We demonstrate, for the first time, that it is possible to transfer calibrated uncertainty information along with knowledge from upstream tasks to enhance a model’s capability to perform downstream tasks.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords