Sound field reconstruction using neural processes with dynamic kernels

Zining Liang; Wen Zhang; Thushara D. Abhayapala

doi:10.1186/s13636-024-00333-x

EURASIP Journal on Audio, Speech, and Music Processing (Feb 2024)

Sound field reconstruction using neural processes with dynamic kernels

Zining Liang,
Wen Zhang,
Thushara D. Abhayapala

Affiliations

Zining Liang: Center of Intelligent Acoustics and Immersive Communications, School of Marine Science and Technology, Northwestern Polytechnical University
Wen Zhang: Center of Intelligent Acoustics and Immersive Communications, School of Marine Science and Technology, Northwestern Polytechnical University
Thushara D. Abhayapala: Audio and Acoustic Signal Processing Group, College of Engineering and Computer Science, The Australian National University

DOI: https://doi.org/10.1186/s13636-024-00333-x
Journal volume & issue: Vol. 2024, no. 1
pp. 1 – 15

Abstract

Read online

Abstract Accurately representing the sound field with high spatial resolution is crucial for immersive and interactive sound field reproduction technology. In recent studies, there has been a notable emphasis on efficiently estimating sound fields from a limited number of discrete observations. In particular, kernel-based methods using Gaussian processes (GPs) with a covariance function to model spatial correlations have been proposed. However, the current methods rely on pre-defined kernels for modeling, requiring the manual identification of optimal kernels and their parameters for different sound fields. In this work, we propose a novel approach that parameterizes GPs using a deep neural network based on neural processes (NPs) to reconstruct the magnitude of the sound field. This method has the advantage of dynamically learning kernels from data using an attention mechanism, allowing for greater flexibility and adaptability to the acoustic properties of the sound field. Numerical experiments demonstrate that our proposed approach outperforms current methods in reconstructing accuracy, providing a promising alternative for sound field reconstruction.

Published in EURASIP Journal on Audio, Speech, and Music Processing

ISSN: 1687-4722 (Online)
Publisher: SpringerOpen
Country of publisher: United Kingdom
LCC subjects: Science: Physics: Acoustics. Sound; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://asmp-eurasipjournals.springeropen.com

About the journal

Abstract

Keywords