Improving the Robustness of DTW to Global Time Warping Conditions in Audio Synchronization

Jittisa Kraprayoon; Austin Pham; Timothy J. Tsai

doi:10.3390/app14041459

Applied Sciences (Feb 2024)

Improving the Robustness of DTW to Global Time Warping Conditions in Audio Synchronization

Jittisa Kraprayoon,
Austin Pham,
Timothy J. Tsai

Affiliations

Jittisa Kraprayoon: Department of Computer Science, Columbia University, New York, NY 10027, USA
Austin Pham: SEAS Columbia Engineering—Computer Science, Columbia University, New York, NY 10027, USA
Timothy J. Tsai: Department of Engineering, Harvey Mudd College, Claremont, CA 91711, USA

DOI: https://doi.org/10.3390/app14041459
Journal volume & issue: Vol. 14, no. 4
p. 1459

Abstract

Read online

Dynamic time warping estimates the alignment between two sequences and is designed to handle a variable amount of time warping. In many contexts, it performs poorly when confronted with two sequences of different scale, in which the average slope of the true alignment path in the pairwise cost matrix deviates significantly from one. This paper investigates ways to improve the robustness of DTW to such global time warping conditions, using an audio–audio alignment task as a motivating scenario of interest. We modify a dataset commonly used for studying audio–audio synchronization in order to construct a benchmark in which the global time warping conditions are carefully controlled, and we evaluate the effectiveness of several strategies designed to handle global time warping. Among the strategies tested, there is a clear winner: performing sequence length normalization via downsampling before invoking DTW. This method achieves the best alignment accuracy across a wide range of global time warping conditions, and it maintains or reduces the runtime compared to standard usages of DTW. We present experiments and analyses to demonstrate its effectiveness in both controlled and realistic scenarios.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords