Sampling the user controls in neural modeling of audio devices

Otto Mikkonen; Alec Wright; Vesa Välimäki

doi:10.1186/s13636-024-00347-5

EURASIP Journal on Audio, Speech, and Music Processing (May 2024)

Sampling the user controls in neural modeling of audio devices

Otto Mikkonen,
Alec Wright,
Vesa Välimäki

Affiliations

Otto Mikkonen: Acoustics Laboratory, Department of Information and Communications Engineering, Aalto University
Alec Wright: Acoustics Laboratory, Department of Information and Communications Engineering, Aalto University
Vesa Välimäki: Acoustics Laboratory, Department of Information and Communications Engineering, Aalto University

DOI: https://doi.org/10.1186/s13636-024-00347-5
Journal volume & issue: Vol. 2024, no. 1
pp. 1 – 13

Abstract

Read online

Abstract This work studies neural modeling of nonlinear parametric audio circuits, focusing on how the diversity of settings of the target device user controls seen during training affects network generalization. To study the problem, a large corpus of training datasets is synthetically generated using SPICE simulations of two distinct devices, an analog equalizer and an analog distortion pedal. A proven recurrent neural network architecture is trained using each dataset. The difference in the datasets is in the sampling resolution of the device user controls and in their overall size. Based on objective and subjective evaluation of the trained models, a sampling resolution of five for the device parameters is found to be sufficient to capture the behavior of the target systems for the types of devices considered during the study. This result is desirable, since a dense sampling grid can be impractical to realize in the general case when no automated way of setting the device parameters is available, while collecting large amounts of data using a sparse grid only incurs small additional costs. Thus, the result provides guidance for efficient collection of training data for neural modeling of other similar audio devices.

Published in EURASIP Journal on Audio, Speech, and Music Processing

ISSN: 1687-4722 (Online)
Publisher: SpringerOpen
Country of publisher: United Kingdom
LCC subjects: Science: Physics: Acoustics. Sound; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://asmp-eurasipjournals.springeropen.com

About the journal

Abstract

Keywords