Dynamically localizing multiple speakers based on the time-frequency domain

Hodaya Hammer; Shlomo E. Chazan; Jacob Goldberger; Sharon Gannot

doi:10.1186/s13636-021-00203-w

EURASIP Journal on Audio, Speech, and Music Processing (Apr 2021)

Dynamically localizing multiple speakers based on the time-frequency domain

Hodaya Hammer,
Shlomo E. Chazan,
Jacob Goldberger,
Sharon Gannot

Affiliations

Hodaya Hammer: Faculty of Electrical Engineering
Shlomo E. Chazan: Faculty of Electrical Engineering
Jacob Goldberger: Faculty of Electrical Engineering
Sharon Gannot: Faculty of Electrical Engineering

DOI: https://doi.org/10.1186/s13636-021-00203-w
Journal volume & issue: Vol. 2021, no. 1
pp. 1 – 10

Abstract

Read online

Abstract In this study, we present a deep neural network-based online multi-speaker localization algorithm based on a multi-microphone array. Following the W-disjoint orthogonality principle in the spectral domain, time-frequency (TF) bin is dominated by a single speaker and hence by a single direction of arrival (DOA). A fully convolutional network is trained with instantaneous spatial features to estimate the DOA for each TF bin. The high-resolution classification enables the network to accurately and simultaneously localize and track multiple speakers, both static and dynamic. Elaborated experimental study using simulated and real-life recordings in static and dynamic scenarios demonstrates that the proposed algorithm significantly outperforms both classic and recent deep-learning-based algorithms. Finally, as a byproduct, we further show that the proposed method is also capable of separating moving speakers by the application of the obtained TF masks.

Published in EURASIP Journal on Audio, Speech, and Music Processing

ISSN: 1687-4722 (Online)
Publisher: SpringerOpen
Country of publisher: United Kingdom
LCC subjects: Science: Physics: Acoustics. Sound; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://asmp-eurasipjournals.springeropen.com

About the journal

Abstract

Keywords